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ABSTRACT 

We develop empirical methods for modeling the galaxy population and populating cosmological N- 
body simulations with mock galaxies according to the observed properties of galaxies in survey data. 
We use these techniques to produce a new set of mock catalogs for the DEEP2 Galaxy Redshift Survey 
based on the output of the high-resolution Bolshoi simulation, as well as two other simulations with 
different cosmological parameters, all of which we release for public use. The mock-catalog creation 
technique uses subhalo abundance matching to assign galaxy luminosities to simulated dark-matter 
halos. It then adds color information to the resulting mock galaxies in a manner that depends on the 
local galaxy density, in order to reproduce the measured color-environment relation in the data. In 
the course of constructing the catalogs, we test various models for including scatter in the relation 
between halo mass and galaxy luminosity, within the abundance-matching framework. We find that 
there is no constant-scatter model that can simultaneously reproduce both the luminosity function 
and the autocorrelation function of DEEP2. This result has implications for galaxy-formation theory, 
and it restricts the range of contexts in which the mocks can be usefully applied. Nevertheless, careful 
comparisons show that our new mocks accurately reproduce a wide range of the other properties of 
the DEEP2 catalog, suggesting that they can be used to gain a detailed understanding of various 
selection effects in DEEP2. 

Subject headings: galaxies: evolution — galaxies: high-redshift — large-scale structure of the universe 
— galaxies: halos — dark matter 



1. INTRODUCTION 

The accurate interpretation of galaxy survey data re- 
quires a careful consideration of a wide range of bi- 
ases and incompleteness that can arise from the selec- 
tion of the surveyed galaxies. Galaxy selection most 
commonly occurs as a function of galaxy flux, color, 
and (particularly for spectroscopic surveys) local den- 
sity on the sky. Since these three properites are well 
known to be strongly correlated with one another (e.g. 
iHogg et all 120041 iCooper et all 120061 ). a selection algo- 
rithm that chooses galaxies based on one of them will 
also produce selection biases in the distributions of the 
others. Estimating the scale of secondary selection ef- 
fects like these requires a detailed understanding of the 
interplay between different galaxy properties; this be- 
comes more important and more complex when con- 
sidering surveys that cover a wide range in redshift, 
since the galaxy population may evolve substantially 
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over the lookback time of the survey. The use of mock 
(i.e., simulated) galaxy catalogs to understand and ac- 
count for selection effects has therefore been an impor- 
tant part of the analysis in nearly all modern galaxy 
surveys, particularly those probing redshifts of order 
unity , such as D EEP2 dDavis et alJl2003t iNewman et"all 
l20ia. VVDS (ILe Fevre et al.l I2005T). and zCOSMOS 
([Lilly et al.ll2009h . In this study, we develop a set of em- 
pirically based techniques for constructing mock galaxy 
catalogs for high-redshift surveys. We also produce a 
new set of catalogs for the DEEP 2 survey and make them 
available to the public, although our techniques should 
be generally applicable to any spectroscopic galaxy sur- 
vey. 

At present, the detailed astrophysics of galaxy forma- 
tion is not sufficiently well understood to directly simu- 
late the galaxy population from first principles. However, 
there is strong support, through many different lines of 
evidence, for a basic picture of galaxy formation which 
holds that (1) all galaxies are embedded in larger dark 
matter halos that comprise most of their mass, (2) above 
some minimum mass threshold, all dark matter halos 
(whether primary halos or subhalos within a primary) 
host a single galaxy at their centers, and (3) there is a 
tight relation between the optical luminosity of a galaxy 
and the mass of its dark matter host. This paradigm sug- 
gests an obvious approach to constructing mock galaxy 
catalogs by assigning a galaxy properties to each halo or 
subhalo in a dark-matter-only N-body simulation, using 
some prescription for mapping between galaxy proper- 
ties and (sub)halo properties. Techniques for convert- 
ing periodic N-body simulation volumes into the cone- 
shaped geometries typical of surveys, and for including 
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the effects of cosmic structure evolution at high redshift , 
are now well deve l oped (e.g.. lYan. White. & Coi]| l200l 
iBlaizot et al.l2005t iKitzbichler fe Whitdl2007t ). What re- 
mains is to specify a halo-galaxy connection that repro- 
duces the observed galaxy population with high accuracy. 

One would ideally like to have a physically well- 
motivated method for assigning galaxies to dark-matter 
halos, so numerous authors have constructed mocks us- 
ing semi-analytic models of galaxy form ation applied to 
the underlying N-body sim u lations fe.^..|Eke et al1l2004t 
IKitzbichler fc White! 120071: IHenriques et all 120121 ) . At 
present, however, most such models fail to accurately 
reproduce the color distribution of galaxies at redshifts 
near unity. An additional difficulty arises because, un- 
til quite recently, N-body simulations of cosmologically 
interesting volumes have not had sufficient particle num- 
bers to resolve the relatively low-mass halos and sub- 
halos that host the f aintes t galaxies in modern surveys. 
lYan. White, fc Coill (l200l (hereafter YWC) addressed 
these problems by taking an empirically based halo- 
model approach, using a conditional luminosity function 
to construct mock catalogs for the DEEP2 redshift sur- 
vey, populating massive halos with one central galaxy 
and a number of satellites depending on the halo mass, 
in such a way as to match the measured DEEP2 lumi- 
nosity and autocorrelation functions. Since the satellites 
had to be randomly assigned to dark matter particles in 
these halos, rather than to well-defined subhalos, this ap- 
proach likely mis-estimated the spatial profiles and veloc- 
ity distributions of galaxy groups and clusters. Given the 
simulations available at the time, however, even this ap- 
proach could only account for the relatively bright galax- 
ies probed by DEEP2 at z > 0.7; the fainter galaxy pop- 
ulation sampled at low z was not included. 

The YWC mocks also did not include informa- 
tion on galaxy pr operties besides luminosity, although 
IGerke et all (2007) were able to add color information to 
these mocks according to the measured relation between 
color and local galaxy density. Their approach to color 
assignment was inspired by the ADDGALS mock-catalog 
creation algorithm that was used to produce mock ca ta- 
logs for the Sloan D i gital Sk y Survey (|York et al~ll2000f l in 
e.g., iKoester et al.l (|2007al) . and which will be described 
in detail by Wechsler et al. (in preparation). Even with 
the addition of colors, these mocks were not sufficient 
to model the faint, low-redshift population probed by 
DEEP2 in one of its three observational fields. In addi- 
tion, the YWC mocks were based on N-body simulations 
whose underlying cosmological parameters are at vari- 
ance with the best fit to current data. For these reasons, 
we are motivated to improve upon the YWC efforts and 
construct new mock catalogs for the DEEP2 survey. 

Since the construction of the YWC mocks, improve- 
ments in both software and hardware have allowed for 
a substantial increase in the mass resolution of N-body 
simulations of a g iven volume. Most recently, the 
Bolsh oi simulation (|Klypin. Truiillo-Gomez. fc Primackl 
I2011D simulated the growth of cosmic structure in a cu- 
bical volume 250 comoving h _1 Mpc on a side, resolving 
halos and subhalos dow n to masses smaller than those of 
the Magellanic Clouds (|Busha et al.l 1201 it ) and using a 
set of cosmological parameters that is in good agreement 
with current constraints from a wide variety of data. The 



combination of a large volume and excellent mass reso- 
lution permit the construction of DEEP2 mock catalogs 
over the full redshift and luminosity range probed by the 
survey, except for a handful of very faint low-z dwarfs. 
Moreover, because Bolshoi resolves subhalos down to the 
required mass range for DEEP2, we will be able to as- 
sign galaxies to halos and subhalos individually, rather 
than taking a halo-model based approach, which should 
better reflect the phase-space distributions of galaxies in 
groups and clusters. 

Lacking a sufficiently accurate astrophysical model of 
galaxy formation, we carry on with a purely empirical 
approach to mock-catalog construction. A popular tech- 
nique for assigning galaxy luminosities, known as sub- 
halo abundance matching, has been demonstrated as a 
feasible means of reproducing both the galaxy luminos- 
ity function and the autocorrelation function at a vari- 
ety of redshifts, under certain cosmo logical assumptions 
(jConrov. Wechsler. fc Kravt sov 2006); we adopt this ap- 
proach here and explore its applicability in more detail. 
To add color information to the resulting mock catalogs, 
we expand o n the environment-dependent approach of 
IGerke et al.l (j2007l ). making substantial improvements to 
accurately model galaxies near the DEEP2 flux limit, as 
well as various luminosity and color-dependent sources of 
incompleteness in the survey. We also repeat the mock- 
making procedure for two other high-resolution N-body 
simulations with different cosmological parameters from 
Bolshoi, to facilitate tests for cosmological dependence 
in the DEEP2 selection function. 

We proceed as follows. The next section introduces the 
DEEP2 survey and the N-body simulations. Section [3] 
details our methods for assigning galaxy properties to 
halos and subhalos, while Section U describes the tech- 
niques we use to replicate various DEEP2 selection effects 
in the resulting catalogs. Section [5]is likely the most use- 
ful portion of the paper for users of these mock catalogs, 
since it makes detailed comparisons between the mock 
catalogs and the DEEP2 dataset. In the Appendix, we 
describe the contents of the public DEEP2 mock catalogs 
and give brief examples of their use. Throughout this pa- 
per, unless otherwise specified, distances are quoted in 
comoving hr x Mpc and absolute magnitude values are 
given as M — 5 log h. 

2. DATA AND SIMULATIONS 
2.1. The DEEP2 Galaxy Redshift Survey 

The DEEP2 (Deep Extragalactic Evolutionary Probe 
2) Galaxy Redshift Survey is a large spectroscopic sur- 
vey of galaxies at z ~ 1 comprising spectra of some 
50,000 objects and ~ 35,000 confirmed galaxy redshifts 
in four observational fields covering a total of ~ 3 deg 2 
on the sky. Complete details of the survey, including tar- 
get selection, spectroscopy, data reduction, and r edshif t 
assignment procedures appear in iNewman et al.l (2012); 
su bstantial discussion of th e se iss u es is also availabl e 
in iDavis. Gerke. fc Newman! (|2004l ) : iDavis et al.l (|2007f ). 
and iWillmer et al.l (|2006l ) (hereafter W06). Here, we 
summarize the properties of the survey that we will need 
to replicate in our mock catalogs. 

Spectroscopic targets for DEEP2 were selected from 
deep imaging with the C FH12k camera on the Canada- 
France-Hawaii telescope (jCoil et al.l 120041 ) in the £?, R 
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and I bands. Three of the four fields each comprise three 
CFH12k pointings, oriented along lines of constant dec- 
lination, making a 1 deg 2 contiguous field with a 2 deg 
x 0.5 deg aspect ratio. The fourth field is the 2 deg x 
0.25 deg Extended Groth Strip (ECS), which is oriented 
perpendicular to the ecliptic and has been the site of a 
wide variety of observations across the full range of the 
electromagnetic spectrum. 

The spectroscopic selection has an i?-band apparent 
magnitude limit of 24.1, and a selection in observed color- 
color space was also applied to exclude galaxies at red- 
shifts below z r-i 0.75, except in the ECS, where galaxies 
were observed throughout color space, but with a prefer- 
ence for galaxies expected to lie at z > 0.75. Spectra were 
obtai ned with the DEIMOS spectrograph ([Faber et al.l 
I2003T ) on the Keck II telescope, which uses custom-milled 
slitmasks to allow multiplexed slit spectroscopy of > 100 
objects simultaneously. The DEEP2 spectroscopic tar- 
geting algorithm tiles the DEEP2 fields with DEIMOS 
slitmasks in an overlapping pattern such that most of 
the suitable galaxies have at least two chances of being 
selected for spectroscopy (four chances in the ECS). Be- 
cause the total slit length available is limited, it is not 
possible to target all suitable galaxies for spectroscopy; 
the targeting efficiency of DEEP2 is ~ 60%. 

Data reduction and initial redshift estimation was 
perfo rmed using a software pipeline (|Newman et al.l 
|2012| ) that was specially designed for use with DEEP2 
DEIMOS data. Each spectrum was then examined by 
eye and the initial redshift estimate was either confirmed 
or corrected, or the spectrum was rejected as a faili- 
ure. Roughly 70% of DEEP2 spectra yield successful 
rcdshifts. We estimate, however, that ~ 15% of DEEP2 
targets lie beyond the DEEP2 target redshift range (C. 
Steidel 2003, private communication), at z > 1.4, where 
all useful spectral features for redshift identification have 
shifted out of the optical waveband. Thus, the redshift- 
success rate for DEEP2 galaxies in the appropriate red- 
shift range is ~ 85%; when combined with the targeting 
rate, this gives an overall spectroscopic sampling rate for 
DEEP2 of - 50%. 

2.2. N-Body Simulations 

We use dark matter halos from three N-body simu- 
lation s. Our principal results use the Bolshoi simula- 
tion ([Klvpin. Truiillo-Gomez. fc Primackl 120111) , which 
modeled a 250 hr 1 Mpc comoving box with fi m = 
0.27, n A = 0.73, ct 8 = 0.82, n = 0.95, and h = 
0.7. The simulation volume contained 2048 3 parti- 
cles, each with mass 1. 15 x 10 8 h~ 1 M P) , and was run 
using the ART code ()Kravtsov. Klvpin. fe Khokhlovl 
119971 ). Halos and subhalos were iden t ified using 
the BDM algorithm dKlvpin fc Holtzmanl 119971 ): see 
iKlvpin. Truiillo-Gomez' fc Primackl (|201 if ) for details. 
One unique aspect of this simulation is the high spatial 
resolution, which is resolved down to a physical scale of 
1 /i _1 kpc. This improves the tracking of halos as they 
merge with and are disrupted by larger objects, allowing 
them to be followed even as they pass near the core of the 
halo. The resulting halo catalog is nearly complete for 
objects down to a circular velocity of v max = 55 km s _1 . 
Merger tree s were created using the consistent merger 
tree code of iBehroozi et al.l (|201lD . 

The additional simulations used are described in Table 



[TJ These simulations were also run with the ART code 
and analyzed in the same way as Bolshoi; we primar- 
ily use them to test the impact of varying cosmological 
parameters on the properties of our mock catalogs. All 
results shown use the Bolshoi simulation, with the excep- 
tion of Figure 12, which compares the halo occupation 
distribution between the simulations. 

3. MOCK-CATALOG CREATION ALGORITHM 

A brief summary of our algorithm for constructing 
DEEP2 mock catalogs from the N-body simulations is 
as follows. We first construct redshift-space lightcones 
with the geometry of the DEEP 2 by defining an ob- 
server position and line-of-sight direction in the z — 
simulation output and then stacking different simulation 
outputs along the line of sight, making use of the boxes' 
periodic boundary conditions. We then assign monochro- 
matic (£>-band) galaxy luminosities to the dark matter 
halos and subhalos, according to the measured DEEP2 
luminosity function , using the subhalo abundance match- 
ing technique (e.g. iConrov. Wechsler. fc Kravtsovl 120061 : 
IBehroozi. Conrov. fc Wechslerll2010[) . and we account for 
the measured redshift evolution in this luminosity func- 
tion. We assign each mock galaxy a U — B color drawn 
from the DEEP2 survey according to its local density, 
by matching the color distribution in quintiles of local 
density between DEEP2 and the mock catalog, and we 
apply an inverse /c-correction to these galaxies to obtain 
an observed i?-band apparent magnitude for each one. 
Finally, we apply the various DEEP2 selection cuts, in- 
cluding density-dependent incompleteness arising from 
the scheduling of galaxies for DEIMOS spectroscopic ob- 
servation and color, luminosity, and redshift-dependent 
incompleteness arising from the failure to obtain reliable 
redshifts for some targets. We explain each of these steps 
in detail in the following sections. 

3.1. Constructing the lightcone 

To construct a realistic mock catalog, it is necessary 
to convert the real-space positions of halos and sub- 
halos in an N-body simulation into redshift-space po- 
sitions in a lightcone that conforms to the survey ge- 
ometry. To the extent possible, we would also like 
the positions at each redshift to correspond to the ap- 
propriate epoch of the simulation — so that, for exam- 
ple, mock galaxies at z = 1 are drawn from a snap- 
shot of the simulation at a scale factor a — 0.5. Since 
we would like our lightcones to represent the DEEP2 
survey volume as realistically as possible, we wish to 
avoid duplicating any region of the simulation volume 
in any one lightcone. Numerous authors have imple- 
mented lightcone-making for mock catalogs (e .g., YWC, 
iBlaizot et afl [20051 iKitzbichler fc Whitei r2007). Here, in 
the two smaller simulation boxes (120 and 160 Mpc), we 
closely follow the approach of YWC. In the Bolshoi sim- 
ulation, owing to the larger volume available, we take 
a somewhat more flexible approach. We outline these 
lightcone-construction algorithms briefly below. 

The basic idea is to choose an observer position within 
the simulation box, an observational field geometry, and 
a direction of observation. We can then trace a cone 
through the box, making use of the periodic boundary 
conditions of the simulation to "wrap" the lightcone and 
construct a line of sight that is significantly longer than 
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the nominal dimensions of the box. Once the cone is 
constructed, it is straighforward to convert the positions 
of halos and subhalos in the simulation box into RA, 
Dec and line-of-sight distance coordinates in the light- 
cone. We then convert the line-of-sight distance into a 
redshift for each object using the distance-redshift re- 
lation for the background cosmology of the mock and 
adding a peculiar- velocity contribution derived from the 
velocity component of each object along the chosen line of 
sight. In each simulation, we include halos and subhalos 
down to masses of a few times 10 9 M Q , which includes 
the mass range of the faint (Mb > 15) dwarf galaxies 
that DEEP2 probes at low redshift in the EGS. 

It is important to take some care in the choice of line- 
of-sight direction, angling it to ensure that the lightcone 
does not overlap itself on subsequent passes through the 
simulation box and repeatedly sample the same volume 
(see, e.g., the bottom panel in Figure 2 of iBlaizot et al.1 
2005). This also imposes a practical limit on the redshift 
range that a given lightcone can probe, which depends 
on the geometry of the observational field: a wide light- 
cone will quickly reach a comoving transverse size similar 
to the box size for example, making overlaps inevitable. 
Since DEEP2 is nearly a pencil-beam survey, however 
(the fields are ~ 2 degrees across in the longest dimen- 
sion), this is not a limitation for our purposes in practice. 

YWC used a simulation whose output timestep spac- 
ing corresponded to the light-travel time over roughly 
half of the simulation box dimensions. Their approach 
to constructing lightcones was to start with an observer 
position on one face of the simulation cube and transi- 
tion between timesteps each time the line of sight had 
traversed half of the box. It is then possible to make 
a second lightcone by translating the observer position 
halfway across the box in the direction perpendicular 
to the face that was chosen initially. This procedure 
can be repeated for each of the six box faces, allowing 
a maximum of twelve lightcones to be constructed. If 
more lightcones or a finer time-spacing are desired, it is 
possible to generalize this approach to start with a ran- 
domly selected observer location and sightline, subject 
to t he requirement that the lig htcone not overlap itself 
(e.g. IKitzbichler fc White! [2001 . 

How many lightcones should we produce for each sim- 
ulation? Since we would like to use our mock catalogs to 
estimate the sample variance in DEEP2, we would like 
our lightcones to probe independent (i.e., nonoverlap- 
ping) volumes as much as is possible. In the ideal case, 
all of our mock lightcones would contain completely in- 
dependent volumes, and so we should stop constructing 
lightcones once their total volume equals the volume of 
the simulation bojQ. For many purposes, however, it is 
sufficient if the mocks are independent only at fixed red- 
shift, or, more precisely, in redshift bins of the width that 
we are typically interested in considering in our analyses. 
That is, if we want to estimate the sample variance in 
bins of width Az = 0.1, then it is acceptable if mock A 
contains the same volume at z = 1 that mock B contains 
at z = 0.5. For this reason, we will tend to "overfill" our 

7 Neither of our algorithms absolutely guarantees that two dif- 
ferent lightcones will not overlap, of course, but since DEEP2 
is nearly a pencil-beam survey, any overlapping volume between 
mocks should be small. 



simulation boxes with mock catalogs by a factor of a 
few, to allow for better statistical power in estimating 
the sample variance in redshift bins. 

A single 1 deg 2 DEEP2 observational field contains 
a total volume of ~ 2 x 10 6 ft.~ 3 Mpc 3 over the primary 
DEEP2 redshift range of 0.75 < z < 1.4, and a redshift 
bin of width Az = 0.1 has a volume ~3x 10 5 /i _3 Mpc 3 . 
Thus, we can fit one complete DEEP 2 field into the LI 20 
simulation, and two into the L160 box, and on the or- 
der of ten independent Az ~ 0.1 redshift bins into each. 
We therefore use exactly the algorithm that was used in 
YWC to construct lightcones for these two boxes, pro- 
ducing twelve lightcones for each box. Given the total 
volume of the boxes, we expect that redshift bins of width 
0.125 will contain independent volumes across all twelve 
cones in the L160 box, and bins of width 0.05 will be 
independent across the L120 lightcones. These numbers 
are summarized in Table [T] In each case, wider bins will 
allow proportionally fewer independent cones. Both of 
these simulations have sufficient time resolution in their 
outputs that we can follow the YWC technique of transi- 
tioning between timesteps whenever the sightline has tra- 
versed half the box — except in the case of the L120 box 
at z > 1. In that regime, there are only a few simulation 
outputs covering the entire range from 1.0 < z < 1.4. 
For this reason, we advise against using the L120 box 
for applications that require an accurate representation 
of the redshift evolution of cosmic structure. 

The larger volume of the 250/i~ 1 Mpc Bolshoi simu- 
lation allows us to construct significantly more indepen- 
dent DEEP2 samples, with the volume of eight fully inde- 
pendent DEEP 2 fields (over the primary 0.75 < z < 1.4 
redshift range) available in Bolshoi, and scores of inde- 
pendent Az ~ 0.1 redshift bins available. We there- 
fore use the more flexible approach to lightcone-building 
described above, with randomly selected observer posi- 
tions and lines of sight, to construct forty DEEP2 light- 
cones from the Bolshoi simulation. These forty lightcones 
should be almost completely independent in redshift bins 
as wide as 0.15. The time resolution of Bolshoi is suffi- 
ciently good over the entire redshift range sampled to 
allow multiple timesteps to be used in each crossing of 
the box, so the growth of large-scale structure should be 
captured with sufficient accuracy for any practical pur- 
poses. 



3.2. Luminosity assignment by subhalo abundance 
matching 

One of our primary goals in constructing the mock cat- 
alogs presented here is to produce a simulated catalog 
appropriate for testing and optimizing algorithms to de- 
tect groups and clusters of galaxies in redshift space in 
DEEP2. We also wish to use these mocks to calibrate the 
halo-mass selection function that corresponds to the ob- 
servational selection of galaxy groups {i.e., systems with 
two or more observed members in DEEP2). Therefore, 
we want to ensure that our mock catalogs accurately re- 
produce both the number of observed galaxies in massive 
halos and their distribution in phase space, since redshift- 
space cluster-finding algorithms will be strongly sensitive 
to both of these. 
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Table 1 

Parameters of the three N-body simulations used to produce mock catalogs in this work. 



Simulation 


Box Dimension 




<?8 


h 


#of 


# independent 


Max. independent 




(h- 1 Mpc) 








lightcones 


DEEP2 fields 


z bin width a 


L120 


120 


0.3 


0.7 


0.73 


12 


1 


0.05 


L160 


160 


0.24 


0.9 


0.7 


12 


2 


0.15 


Bolshoi 


250 


0.27 


0.82 


0.7 


40 


8 


0.15 



a So that, for example the same redshift bin in the L160 lightcones samples a different region of space 
in all twelve cones. 



3.2.1. Subhalo abundance matching: rationale and basic 

approach 

In the language of the halo model, our first require- 
ment translates to accurately reproducing the halo oc- 
cupation probability, P(N\M), which is the probability 
that N galaxies brighter than some luminosity threshold 
dwell in a halo of mass M. This distribution is typi- 
cally represented by its first moment, N(M), typically 
called the halo occupation distribution (HO D), plus an 
assum ed form for the scatter in N (see, e.g.. iZheng et al.l 
2005). For a given background cosmology, a particular 
choice of HOD uniquely spec ifies the two-point auto cor- 
relation function of galaxies (jPeacock fc S mith 200(3). If 
we assume our simulation has the correct background 
cosmology, we can choose a prescription for populating it 
with mock galaxies and test the accuracy of the HOD by 
comparing its correlation function to the one measured 
in the DEEP2 data. YWC took this basic approach: 
they assigned galaxies to massive dark matter halos as 
a function of mass accordi ng to a conditional luminos- 
ity fu nction (CLF; see e.g. lYang. Mo. fc van den Boschl 
120031 ). 4>{L\M). It was possible to tune the CLF to re- 
produce the observed DEEP2 galaxy clustering and, once 
this was achieved, to have some confidence that the re- 
sulting HOD in the mocks was an accurate representation 
of the DEEP2 HOE@. 

However, the N-body simulations YWC used did not 
have sufficiently high resolution to include the low-mass 
subhalos that would be expected to host the faintest 
DEEP 2 galaxies in groups. Thus, instead of placing all 
galaxies in halos or subhalos, those authors assigned a 
single galaxy to the center of each halo and satellite 
galaxies to the positions and velocities of dark-matter 
particles selected at random from the halo. Since the 
distribution of galaxies within a cluster is unlikely to per- 
fectly trace the dark matter distribution, this approach 
will produce inaccuracies in the phase-space distribution 
of galaxies on small spatial scales. Put another way, the 
YWC mocks do not include the effects of luminosity- 
dependent galaxy bias (except to the extent that the 
brightest galaxy in each halo was placed at the center). 
This could have important implications for cluster find- 
ing if bright galaxies in clusters have a significantly differ- 
ent s patial or velocity distribution than faint ones. (In- 
deed, iCoil et all (|006aT ) showed that radial distribution of 

8 However, the comparison between the two-point functions of 
the mocks and the data was performed using early DEEP2 mea- 
surements with large uncertainties. More recent measurements of 
the DEEP2 clustering are not consistent with the YWC mocks, 
sharpening the need for a new set of catalogs. In addition, the 
background cosmology in the YWC mocks is no longer consistent 
with present data. 



galaxies in halos in the YWC mocks was not consistent 
with the DEEP2 data.) 

For this reason, we have been careful in this work 
to choose high-resolution N-body simulations in which 
the lowest-mass halos and subhalos have a number den- 
sity that matches or exceeds that of the faintest galax- 
ies observed in DEEP2. This will permit us to assign 
all galaxies directly to the centers of halos and subhalos 
and thus, presumably, to more accurately reproduce the 
phase-space distribution of DEEP2 galaxies. This also 
allows us to bypass the step of selecting a CLF or HOD 
and tuning it to match the DEEP2 correlation function. 
Instead, we can construct a direct relation between a 
galaxy's luminosity and the properties of its host halo 
or subhalo, choosing this relation in such a way as to 
reproduce the DEEP 2 clustering results. 

The simplest way to construct such a relation is to 
match the luminosity function of galaxies directly to the 
mass function of subhalos at fixed number density, which 
is the core of the abmdance matching technique. More 
precisely, we integrate the luminosity function to com- 
pute the number density of galaxies brighter than some 
luminosity, n(> L) = J L cj)(L)dL and compare it to the 
number density of halos and subhalos more massive than 
some mass, n[> M) (under a particular choice of halo- 
mass definition^- Setting n(> L) = n(> M) then yields 
an implicit relation between luminosity and mass, L(M) 
that must obtain if a galaxy's luminosity depends only 
on the mass of its host halo. In reality, although there is 
strong evidence for a tight relation between galaxy lumi- 
nosity and halo mass, there are numerous other factors 
that can affect galaxy luminosity at fixed mass, so some 
scatter in this relation is to be expected. In any event, a 
particular implicit mean L(M) relation and scatter about 
that relation will correspond to a particular HOD, and 
hence a particular galaxy autocorrelation function, since 
the subhalos of massive halos will be populated by more 
or fewer bright galaxies depending on the details of the 
chosen relation. If we assume that the basic picture of 
a tight L(M) relation with some scatter is correct, then 
we can extract this relation from the data by populating 
subhalos in a simulation according to such a relation and 
varying the details (e.g., the size of the scatter or the sub- 
halo mass definition) until we obtain an autocorrelation 
function in the simulation that matches the measured 
one; 

iConrov. Wcchslc r. fc Kravtsovl ()2006f ) showed that the 
basic features of the measured autocorrelation func- 

9 It is worth noting here that we always compute n(M) over the 
entire simulation box, rather than for the lightcones individually, to 
minimize the effects of sample variance on our abundance-matching 
procedure. 
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tion could be reproduced using this technique, both at 
low redshifts, from SDSS, and at high redshifts, from 
DEEP2, given two minor alterations to the basic al- 
gorithm described above. First, they used the maxi- 
mum circular velocity values, v max , of the halos, rather 
than their masses, to compute their number density 
function (v max is a more robust tracer of halo mass 
than the total mass of the particles in the halo, be- 
cause it traces the central part of the halo which is 
less sensitive to tidal stripping dKravtsov et al.l 20041 : 
iMuldrew. Pearce. fc Powerll2011t rKnebe et al 1120111 )). In 
addition, when a halo was in fact a subhalo of a more 
massive object, they assigned it the v max value it had 
at the time it was accreted into the larger halo, v^ ax , 
for the purposes of computing halo number density. 
That is, they used abundance matching to derive an im- 
plicit i(v^ ax ) relation. The choice of the accretion-time 
Vmax amounts roughly to assuming that satellite galax- 
ies falling into larger groups or clusters of galaxies are 
stripped of their outer dark matter halos but not stripped 
of stars, so that after they are accreted their luminosities 
are hi gher than the mean -L(v max ) re l ation for distinct 
halos. TConrov. Wechsler. fc Kravtsovl (|2006D found that 
ignoring this effect led to a too-low correleation function 
on small scales. 

3.2.2. Testing the impact of scatter 

Introducing scatter into the L(v^ x ) relation gener- 
ally has the effect of reducing the spatial correlation of 
the mock galaxies. This is because the mass function 
is steeply falling, so that there is always a much larger 
population of lower-mass halos than higher-mass ones for 
a given reference mass, so scatter in luminosity at fixed 
mass will include a large number of low-mass halos in the 
population brighter than a given luminosity, compared 
to the scatter-free case. Since halo bias (the ratio of the 
halo correlation function to the dark-matter one) is an 
increasing function of mass, including low-mass halos will 
suppress the clustering. Reddick et al (in preparation) 
found that the subhalo abundance matching approach 
yields the best match to the SDSS autocorrelation func- 
tion and conditional luminosity function if the L(v pea k) 
relation used has a scatter in luminosity at fixed halo cir- 
cular velocity (using the maximum value of this circular 
velocity over each halo's history, v pea k), with a scatter of 
~ 0.18 dex yielding the best results. 

To investigate the appropriate scatter to use for repro- 
ducing the DEEP2 autocorrelation function, we choose 
the output of the Bolshoi simulation at a = 0.528 (cor- 
responding to z = 0.9, near the peak of the DEEP2 red- 
shift distribution) and use abundance matching to popu- 
late its halos and subhalos with galaxies drawn from the 
z = 0.9 DEEP2 luminosity function measured by W06. 
We repeat this exercise with various different amounts 
of log-normal scatter in the implicit L(v m 3 ax ) relation, 
from to 0.6 dex; in each case the scatter is constant 
as a function of v^ ax . We then compute the projected 
correlation function, w p (r p ), as viewed along one axis of 
the box, and we compare with the measured values in 
DEEP2 (jCoil et alj|006b[ ). for galaxies brighter than a 
few different thresholds in luminosity. 

The scatter model is implemented by an itera- 
tive non-parametric forward convolu tion method, as in 
iBehroozi. Conrov. fe Wechsler (2010). Initial abundance 



matching with zero scatter is used to calculate a fiducial 
L(v m 3 ax ) relation. This relation is then convolved with 
the desired scatter model to generate fiducial halo lu- 
minosities. Abundance matching is repeated; instead of 
matching the brightest galaxy to the halo with the great- 
est v^iax however, the brightest galaxy is matched to the 
halo with the highest fiducial luminosity. This in turn 
generates a new fiducial median ^(v^x) relation, which 
can again be convolved with the scatter model to gener- 
ate new fiducial luminosities; after several iterations of 
this process, a stable relation for L(v%£ x ) is recovered if 
it exists, along with the distribution of halo with 
luminosity (i.e., P(v^\L)). 

Because no assumptions about the scatter model are 
made in this process, we may explore a variety of models. 
However, not all scatter models are "allowed" in the sense 
that the steep fall-off of the luminosity function imposes 
a stringent limit on the scatter for luminous galaxies. 
Even if the underlying relation for i(v^ ax ) imposed a 
sharp maximum limit on the luminosity, the addition of 
log-normal scatter would broaden the fall-off in the lu- 
minosity function; for DEEP2, this broadening becomes 
inconsistent with the observed fall-off for constant scat- 
ter models above 0.3 dex. In these circumstances, our 
implementation recovers a solution for L(v^ x ) which, 
when convolved with the scatter model, cannot match 
the luminosity function at the luminous end, and so only 
approximately matches the luminosity function at lower 
luminosities as well. 

FigurcQ]shows the comparison for absolute magnitudes 
Mb < — 19 and Mb < —20. Open data points denote 
the projected correlation functions we obtain from the 
abundance-matched Bolshoi simulation with different as- 
sumed values for the scatter (error bars have been sup- 
pressed for all but the zero-scatter model, to reduce clut- 
ter). Bl ack solid data po ints show the w p {r p ) measure- 
ments of lCoil et all (|006bD . The solid line shows the best- 
fitting power representation of the clustering obtained 
over all scales in that work, and the dashed and dotted 
lines show the best fits on scales larger and smaller than 1 
Mpc, respectively. The lower panels show the fractional 
difference between the best fit to the data (solid line) 
and the results of the abundance-matching exercise with 
different amounts of scatter. 

Substantial scatter appears to be needed to reproduce 
the DEEP2 clustering results to good accuracy: for scat- 
ter values below 0.4 dex, the abundance-matched Bolshoi 
catalog is significantly more clustered than DEEP2 for all 
threshold luminosities. This suggests that the results of 
iConrov. Wechsler. fc Kravtsovl (|2006[ ). who matched the 
DEEP2 clustering with a zero-scatter abundance match- 
ing model, were an accident of their having used a simu- 
lation whose background cosmology had a high value of 
as- For higher values of the scatter, we find that it is 
possible to reproduce the DEEP2 two-point function for 
a given threshold luminosity, with a scatter of 0.4 dex 
beign sufficient for Mb < —20 and 0.6 dex working well 
for Mb < —19 . but there is no fixed-scatter model that 
reproduces the iCoil et al~l (|006bf) results for all threshold 
luminosities. We note in passing that t his outcome is in 
qualit ative agreement with the study by I Wetzel fe W hite 
(2010), who found that a scatter of ~ 0.6 dex was re- 
quired to approximately match the DEEP2 clustering 
results with an abundance matching approach. Never- 
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Figure 1. Comparison of the projected autocorrelation functions, w p (r p ) in DEEP2 and in our abundance matching populations of the 
Bolshoi simulatio n with different amounts of scatter, for two different absolute magnitude thresholds. Black points show the DEEP2 
clustering results (Coil ct al. 006b), and the solid, dashed, and dotted lines show the best-fitting power laws to those data for all scales 
and scales greater than and less than 1 Mpc, respectively. The colored points denote the clustering measured in the Bolshoi simulation at 
z ~ 0.9 after abundance-matching to the DEEP2 B-band luminosity function, assuming various amounts of scatter in the L(v^ x ) relation, 
as shown in the legend. Error bars on the colored points have been suppressed for all but one of the scatter values to reduce clutter. There 
is little significant effect on clustering for scatter below 0.2 dex, and all such models are more strongly clustered than the DEEP2 data. 
Higher values of the scatter produce more significant impacts on the clustering, but no value of the scatter is consistent with the DEEP2 
data at all luminosity thresholds. 



theless, the lack of a single scatter value that consistently 
matches the DEEP2 clustering at all luminosity thresh- 
olds suggests that this fixed-scatter abundance matching 
approach may be too simplistic to account for the true 
galaxy population at high redshift, at least for samples 
selected in the B-band. 

In fact, the inconsistency is worse than it appears in 
Figure [TJ As mentioned above, our technique for adding 
scatter to the abundance matching approach does not 
necessarily guarantee that the output mock galaxy cata- 
log will match the desired luminosity function. Figure [2] 
shows the luminosity function we obtain from the model 
Bolshoi catalog for the same scatter models discussed 
above (colored points) , compared to the W06 luminosity 
function that we used as input to the abundance match- 
ing algorithm. The models with 0.2 dex scatter and be- 
low match the input luminosity function well, but models 
with higher scatter deviate significantly from the W06 
function at bright luminosities. The reason for the dis- 
crepancy is straightforward to understand. The output 
luminosity function from our abundance matching algo- 



rithm is effectively a convolution between the log-normal 
scatter and an implicit no-scatter luminosity function, 
which must necessarily be steeper at the bright end than 
the input DEEP2 luminosity function, which is itself al- 
ready quite steep. At some value for the scatter, the 
required no-scatter luminosity function will have an in- 
finite slope at the bright end, and for larger values of 
the scatter it will be impossible to reproduce the DEEP2 
luminosity function. 

Figure [T] therefore suggests that the maximum fixed 
log-normal scatter for abundance matching that is con- 
sistent with the W06 DEEP2 luminosity function is ~ 0.2 
dex in luminosity at fixed v^; x . Given that no such 
model is consistent with the DEEP2 correlation func- 
tion, we conclude that the simple fixed-scatter abun- 
dance matching technique for connecting galaxies with 
dark-matter halos is inconsistent with DEEP2 data in 
detail. It is worth noting that the DEEP2 luminosity 
function is measured in the rest-frame B band, which 
is sensitive to transient effects like starbursts and AGN, 
whereas the SDSS luminosity function that has been re- 
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Figure 2. A comparison of the input DEEP2 luminosity func- 
tion (solid line) to the actual luminosity functions achieved in the 
various scatter scenarios for abundance matching shown in Fig- 
ure [l] For scatter of < 0.2 dex in luminosity at fixed v^; x , the 
input luminosity function is reproduced accurately, but for larger 
values of the scatter, the output catalog contains too many bright 
L >* galaxies, indicating that such large values of the scatter in 
a fixed-scatter abudance matching approach are inconsistent with 
the DEEP2 luminosity function, given the Bolshoi cosmology. 

produced by fixed-scatter abundance matching is mea- 
sured in the r band, which should be less sensitive to 
such effects. In any case, the fact that the DEEP2 
luminosity-function discrepancy is limited to bright mag- 
nitudes, coupled with the varying amounts of scatter 
required to match the clustering for different luminos- 
ity thresholds, suggest that a variable-scatter abundance 
matching model whose scatter decreases as a function of 
v^f x might yield consistency will all of the DEEP2 mea- 
su rements. One such model ha s been recently described 
bv iTrujillo-Gomez et al.l ()2011h . which forces agreement 
with the input luminosity function at the expense of non- 
constant scatter. This model is not a unique solution, 
and the possible form and parameters of models with non 
constant scatter are unclear, so a detailed exploration of 
these issues is outside the scope of this work. 

In order to proceed, then, we must choose between an 
accurate luminosity function and an accurate two-point 
function. To guide our choice, we consider that our pri- 
mary goal in constructing mock catalogs is to reproduce 
the observational selection effects in the DEEP2 sample 
as accurately as possible. From that perspective, an inac- 
curate luminosity function is likely to be more damaging 
than an inaccurate two-point function, so we choose to 
prioritize the former over the latter. Only models with 
scatter below ~ 0.2 dex are then allowed. As shown in 
Figure [TJ scatter values in this range have little to no 
significant impact on the clustering, so in the interest 
of simplicity in our modeling, we choose to implement 
a zero-scatter abundance matching model to construct 
DEEP2 mock catalogs. 

Before we move on, however, it is worth exploring the 
impact that including scatter would have on the mocks 
we produce. Figure [3] shows the HODs that result from 
the various fixed-scatter abundance matching models we 
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Figure 3. The halo occupation distributions (HODs) that result 
from applying each of the scatter values for abundance matching 
in Figure [l] to the Bolshoi simulation. Solid lines show the mean 
number of galaxies per halo N(M), and dashed and dotted lines 
show the mean numbers of central and satellite galaxies, respec- 
tively. For scatter values larger than 0.2 dex in luminosity at fixed 
v max > the mean number of central galaxies remains below unity at 
all masses, which is radically inconsistent with current understand- 
ing of galaxy formation. This is a further indication that such large 
values of the scatter in abundance matching are problematic. 

considered in the previous two figures, for two different 
luminosity thresholds. Solid lines show the overall HOD 
N(M), dashed lines show the HOD for central galaxies, 
N C (M), and dotted lines show the HOD for satellites, 
N S (M). The first thing to note is that, for scatter val- 
ues above 0.2 dex, N c never reaches unity, implying that 
even some very massive halos do not host bright cen- 
tral galaxies in these models. This occurs because, as 
mentioned above, the no-scatter luminosity function has 
become infinitely steep at the bright end, which is equiv- 
alent to the mean £(v^ x ) relation's becoming flat at 
high v^f x . In that case, the large scatter will cause some 
central galaxies to scatter below the luminosity thresh- 
old of interest at all masses. This situation is strongly 
at odds with theories of galaxy formation, however and 
is another indication of the problematic nature of these 
high-scatter models. 

For the models with scatter of 0.2 dex and below, the 
HODs are more well behaved, and we note that the im- 
pact of the scatter occurs mainly at low masses, softening 
the cutoff in ./V but having little impact on the form of 
the HOD at large masses. We expect that a more accu- 
rate, variable-scatter abundance matching model would 
increase the scatter in luminosity at low subhalo masses, 
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while keeping the scatter small at high masses. This 
would be expected to further soften the low-mass cut- 
off, while having similarly little impact on the power-law 
part of the HOD at high masses. Thus, we anticipate that 
the galaxy mass-selection function we infer from our fi- 
nal mock catalogs in this work will be somewhat steeper 
in mass than occurs in the real universe. However, the 
mocks should prove reasonably accurate above that mass 
threshold, and in particular they should give an accu- 
rate representation of the galaxy occupation of massive 
groups and clusters, making them appropriate for use 
in optimizing cluster-detect ion algorithms. W e explore 
their use for that purpose in lGerke et al.l ((20121 ) . We also 
note in concluding this section that, should a more accu- 
rate, variable-scatter modeling approach be developed in 
the future, it could straightforwardly be combined with 
the techniques we develop below to assign galaxy colors 
and produce realistic DEEP2 mock catalogs that account 
for all observational selection effects. 

3.2.3. Populating the hghtcones with an evolving DEEP2 
luminosity function 

To produce the galaxy luminosities in our mock light- 
cones, then, we will abundance match the v^ x function 
of halos and subhalos to the DEEP2 luminosity func- 
tion, with zero scatter in the resulting L(v^ x ) rela- 
tion. This is complicated somewhat by the broad red- 
shift range covered by DEEP2, over which the luminos- 
ity function unde rgoes significant evolution (e.g., W06, 
iFaber et al.l 120071 ). Fortunately, this evolution is con- 
sistent with simple luminosity evolution of the global 
galaxy population, with no overall evolution in number 
density or luminosity-function shape. Therefore, we can 
reproduce the DEEP2 luminosity funciton by performing 
abundance matching with a single luminosity function, 
measured at a particular redshift zq in DEEP2, and then 
applying a redshift-dependent correction to the luminosi- 
ti es of the mo c k gala xies . 

IFaber et all (|2007[ ) found that the evolution of the 
overall DEEP2 luminosity function was reasonably well 
described by a pure linear brightening of Mg by 1.2 mag- 
nitudes per unit increase in redshift. However, redshift is 
not particularly well motivated physically as a parame- 
ter against which to measure galaxy evolution. We have 
thus recons i dered the measured M B data points from 
IFaber et al.l (pOOl as a function of the cosmic scale fac- 
tor a = 1/(1 + z). A linear fit to the scale factor yields a 
better description of the data than a linear fit to redshift; 
therefore, in constructing the DEEP2 mocks we will im- 
plement an evolving M B that dims by 2.45 magnitudes 
per unit increase in a. 

More exactly, we populate the mock DEEP2 light- 
con es using an evol ving luminosity function that takes 
the lSchechten (|1976t ) form. 

0(M)=O.41n(lO)0*lO a4(M * (Q) - M)(1 - Q) (1) 
xexp(-10°- 4M *( a ^ M ), 

with 

M*(a) = M*(a ) + Q a (a- a ) (2) 

and ciq = (l + zo) -1 . The parameters we use are listed in 
Table [2] They are based on the DEEP2 values measured 
at z ~ 0.9 in W06, with a few modifications. First, the 



Table 2 

Parameters of the evolving 
luminosity function used to 
produce the mock Hghtcones. 



Parameter Value 

M* -5 log /i a -20.8 

</>*^ 7.90 x 10~ 3 

a 1.30 

Qa 2.45 

20 0.90 



a AB magnitudes. 
° In comoving h 3 Mpc - 3 , 
for a ACDM cosmology with 
Cm = 0.27. This value is 
corrected to correspond to ac- 
count for the different volume- 
redshift relation in each mock 
cosmology. 

value for M B is shifted brighter by 0.13 magnitudes to 
be consistent with the best fit to the evolution of Mg(a). 
Second, because the measured value of <fr* at z = 0.9 
is the highest measured value in all of DEEP2, we in- 
stead take the mean of all values measured at z > 0.7 
in DEEP2, which lowers 4>* from the z = 0.9 value by 
about 12%. 

Finally, we also correct this value of (f>* to account for 
the different cosmological background models in each of 
our three N-body simulations. W06 assumed a ACDM 
cosmology with VLm = 0.3 in their measurements, using 
a volume element computed in that cosmology to con- 
vert the redshift-space galaxy counts to a number den- 
sity. Since they would have inferred a different number 
density had they assumed the cosmology in each of our 
simulation boxes, we must correct the measured value of 
4>* by a factor Vq.s/Vq m , where the volume is given in 
terms of the comoving line-of-sight distance r(z) as 

f 10 dr 
Vn M =4n r 2 (z)—dz, (3) 

JO.8 dz 

Vo.3 is the value for D,m = 0.3, and the integral is taken 
over the redshift range considered for the z ~ 0.9 mea- 
surement of the DEEP2 luminosity function in W06. The 
comoving distance is given by the usual ACDM equation 



dz H ^n A + n M (i + z) 3 ' 

The procedure we have outlined here ensures, by con- 
struction, that the luminosity function of our DEEP2 
mock catalogs will match the input luminosity function 
to within Poisson noise. Figure @] shows the compar- 
ison in bins of redshift. Data points are the measured 
luminosity function for all 40 of the Bolshoi hghtcones, 
solid lines are the curves corresponding to the measured 
parameters from DEEP2 in W06, and dotted lines are 
curves corresponding to Equation [2] for the parameters 
given in Table [5] The dropoff in the low-luminosity dat- 
apoints at high redshift occurs because of an evolving 
luminosity cut we apply to the catalog, which excludes 
galaxies that are obviously below the DEEP2 apparent 
magnitude limit, to keep the catalog sizes manageable. 
Apart from this, the agreement between the luminosity 
function in the mocks and the input model is perfect, as 
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Figure 4. Comparison of the DEEP2 luminosity function to the 
luminosity function in the mock catalogs. Solid lines are the 
DEEP2 luminosity function as measured in W06. Dotted lines 
are the evolving luminosity function as parameterized in Table [2] 
which was used as an input to the mock-catalog creation algo- 
rithm. Data points are the measured number density of galaxies 
in the mock, in bins of luminosity, with Poisson error bars. 

expected. 

3.3. Color assignment by environment matching 

Accurately reproducing the luminosity function, 
though necessary, is not sufficient to fully model the 
DEEP 2 galaxy se lection funct i on. A s discussed at length 
in, e.g., W06 and lGerke et~aT1 (l2007h . DEEP2's R = 24.1 
apparent magnitude limit translates into a -B-band limit 
at z ~ 0.75 and a £/-band limit at z > 1. This means 
that, at a given luminosity, red galaxies will drop out of 
the DEEP2 sample at a lower redshift than blue galaxies. 
There is a well-known strong correlation between galaxy 
color an d galaxy environ ment, i.e., local galaxy den- 
sity {e.g. Hog g et aT1l2004h. and this relation persists to 
z > 1 in DEEP2 (jCooper et al.ll2006[) although it evolves 
stron gly with redshift (jCooper et al.1 120071 I Gerke et al.1 
120071) . If we are to use our mock catalogs for applica- 
tions that probe the galaxy density field, such as testing 
and calibrating cluster-finding algorithms, it is impor- 
tant to include the effects of color-dependent selection 
on the spatial sampling of galaxies. Put differently, wc 
would like to ensure that our mocks have the right HOD 
for galaxies of a particular color. This amounts to repro- 
ducing the galaxy cluste ring as a function of color that 
is observed in DEEP 2 bv lCoil et all (|2008( ). As those au- 
thors also showed (see their Figure 13), this will require 
t hat we reproduce the co lor-en vironment relation. 

IKitzbichler fc White! (pOOl used a semi-analytic 
model for ga laxy formation (fromlCroton et al.ll2006l as 
updated by iDe Lucia fc Blaizoll I2007T) . in an effort to 
reproduce such effects in their mock catalogs. Unfor- 
tunately, this model does not accurately reproduce the 
color-magnitude-environment relation that is present in 
DEEP2 at z ~ 1; in particular, it produces too few faint, 
red galaxies in dense regions. This is a common feature 
of most semi-analytic models at present, and it likely to 
be a particular problem for using these mocks to test 
group-finding algorithms, since the number of observed 



galaxies in real groups will be lower than is predicted by 
the semi-analytic model. 

Fortunately, the existence of an empirical color-density 
relation for galaxies also suggests a purely empirical 
means for reproducing it in mock catalogs, if our goal 
is simply to reproduce the features of the data, with- 
out necessarily understanding the astrophysical processes 
necessary to produce them in detail. Just as we were 
able to populate the N-body simulation with galaxy lu- 
minosities by making use of the known relation between 
luminosity and halo mass, we can add galaxy colors using 
the relation between color and local galaxy density. This 
will be slightly more complicated than the approach we 
used to add luminosities, since in that case we were able 
to posit a very tight (indeed, zero-scatter) relation be- 
tween mass and luminosity, whereas the measured color- 
density relation in D EEP2 has an extremely large scatter 
(jCooper et al.l l2006). within which the galaxy colors take 
their usual bi-modal distribution between red and blue 
objects. Furthermore, there is also a strong correlation 
between galaxy color and luminosity (and thus presum- 
ably also between color and masf3) via the well-known 
red and blue sequences, and each of these correlations 
may be evolving with redshift. 

To ensure that we accurately capture the correlations 
between color, luminosity, and galaxy environment, and 
the scatter and evolution in these relations, we use a 
proceure that can be briefly outlined as follows. First, 
we divide both the DEEP2 dataset and the mock cat- 
alog into bins of redshift, luminosity, and local galaxy 
density. Within each bin, we select galaxies at random 
from DEEP2 (with replacement, and after applying a 
weighting to account for the color-dependent DEEP2 
redshift- failure rate). For each galaxy thus selected, 
we assign its fc-corrected U — B color (as computed 
in W06) to one of the galaxies from the same bin in 
the mock. We explain each step in this procedure in 
more detail below. Our approach automatically repro- 
duces the measured color distribution from DEEP2 in 
each bin and therefore also the evolving color-luminosity- 
density relation and its scatter. It is inspired by the 
ADDGALS algorithm (Wechsler et al. in preparation) 
that was used to produce mock catalogs for use with 
the Sloan Dig i tal Sky Survey cluster-finding efforts (e.g. 
i Koester et all I2007U Uohnston et~aT1 120071 : IRozo et al.1 
12007ft and several other purposes, inc luding testing pho- 
tometric redshifts dGerdes et al.ll2010l) and spectroscopic 
followup (jCunha et al.ll2012t) . Our algorithm differs from 
ADDGALS, however, in its use of subhalo abundance 
matching to assign luminosities and in various details of 
the color-assignment algorithm — e.g., the use of broad- 
band colors, rather than SEDs — that make it specific to 
DEEP2 mock-making. 

3.3.1. Choosing an environment measure 

We have not yet specified what metric we will 
use to measure local galaxy density in this process. 

10 Indeed, there is much debate in the literature at present re- 
garding the relative importance of halo mass and local environment 
in determining galaxy color, with a large contin gent arguing tha t 
mass is the more fundamental parameter (e.g., Woo ct al. 2012). 
For our purposes, though, it is sufficent to note that these cor- 
relations exist in the data, whatever their underlying cause, and 
attempt to reproduce them in the mocks. 
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ICooper et al.l (|2005l) tested a number of different possible 
density measures in the YWC catalogs and found that 
the distance to the nth-nearest neighbor reproduced the 
underlying galaxy density field with the best combina- 
tion of fidelity and dynamic range. The basic strategy is 
to compute the projected distance r n to the nth-nearest 
neighbor within a window of ±1500 km s in the redshift 
direction. This can then be converted into a surface- 
density measure, 

£„ = n/(nr 2 n ). (5) 

Because the mean density of a magnitude-limited galaxy 
sample falls with redshift, it is also important to nor- 
malize the density to account for this; we can do this by 
converting the density values to over densities, 

S n = E„/(£ n (z)>, (6) 

where the quantity in the denominator is the smoothed 
mean density computed in rolling bins of redshift. This 
measure is convenient for our purposes since it is the one 
that was used to measure the color-environment relatio n 
and its evolution in DEEP2 (jCooper et al.l 120061 l2007f ). 
In this work we will follow those studies and measure 
local galaxy density in DEEP2 by computing the third- 
nearest ne ighbor overde n sity S 3, since n = 3 was the 
value that ICooper et al.l (|2005l ) identified as giving the 
most accurate measure of local density in DEEP2. 

Because the DEEP2 targeting algorithm can only 
schedule ~ 70% of appropriate targets for spectroscopy, 
and because the redshift-success rate is ~ 70%, the 
overall sampling rate of the magnitude-limited pool of 
DEEP2 targets is < 50%. When estimating the local en- 
vironment in the mocks for color assignment, we are con- 
sidering a complete, volume-limited sample, so the third- 
nearest-neighbor distance will probe the density field on 
smaller scales in the mocks than it will in DEEP2. It 
will be preferable to choose a density estimator in the 
mocks that is comparable to the S3 values that would be 
measured after accounting for all observational effects. 
To find the appropriate estimator, we computed various 
nth-nearest-neighbor overdensities within an early ver- 
sion of these mock catalogs. We then applied the DEEP2 
targeting algorithm and a simple redshift-failure rate (see 
the following sections for details), and we computed the 
DEEP2 estimator £3 for the resulting "observed" catalog. 
A comparison of the various density measures showed 
that the projected seveni/i-nearest-neighbor overdensity 
in the volume-limited mock 67 was most tightly corre- 
lated with the as-observed S3 values. Therefore we will 
use ^7 as our density estimator for color assignment in 
the mocks. 

A possible complication is that, even if $3 and Sj are 
strongly correlated, their absolute values may not be di- 
rectly comparable. It is not guaranteed that the correla- 
tion will lie along the line of equality on a plot of S3 ver- 
sus 67. One might suppose, for example, that the maxi- 
mum overdensity in the sparsely sampled DEEP2 catalog 
might have a lower numerical value than the maximum 
in the volume-limited catalog. Thus, simply binning the 
data in £3 and the mock in £7 may not give compara- 
ble samples in corresponding bins. To deal with this, 
we divide each catalog into quintiles of local density and 
assume that corresponding quintiles of measured over- 



density contain galaxies with comparable physical local 
densities. For example, the most overdense 20% of mock 
galaxies receive colors drawn from the most overdense 
20% of DEEP2 systems, and so on. 

A final concern for computing projected nearest- 
neighbor distances is edge effects. If a galaxy lies close 
enough to the edge of a field that its true nth-nearest 
neighbor lies outside the field, then the measured S n 
for that galaxy will be misestimated. This problem can 
be handled straightforwardly by imposing buffer regions 
near the field edges. In the DEEP2 data, we exclude 
from our color-assignment algorithm all galaxies that lie 
within 1 comoving h~ x Mpc from any edge or gap in the 
observed region. This ensures that only galaxies with 
accurately measured $3 values are used to assign colors. 
In the mocks, we would like to assign colors to all galax- 
ies within the observed region, so we cannot perform the 
same exclusion. Instead, we construct our mock light- 
cones to have an angular extent that is larger than a 
DEEP2 observed field by 0.2 degrees in either direction. 
This creates an 0.1 degree buffer all the way around the 
"observed" region of the mock, so that galaxies in that 
area will have accurately measured values of #7. 

3.3.2. Adding colors to the mock galaxies 

Having specified our mapping between overdensity 
measured in DEEP2 and in the complete, volume-limited 
mock, we can proceed to add colors to the mock galax- 
ies. Figure [5] gives a schematic picture of our color- 
assignment technique. Within a single bin of redshift 
and MJB, we divide the DEEP2 sample into quin- 
tiles of $3 and the mock into quintiles of Sj . We then 
draw galaxy colors randomly from the galaxies in each 
DEEP2 quintile in turn, and we assign them to the galax- 
ies in the corresponding mock quintile until all mock 
galaxies have been assigned colors. Because the DEEP2 
redshift-success rate depends on color, before perform- 
ing the random draw, we also weight the data by the 
incompleteness-correction weights computed in W06. (In 
the EGS field, we also divide out the color-dependent 
weighting that was applied in the DEEP2 target-selection 
algorithm for this field, to ensure that we have an unbi- 
ased color distribution.) 

This approach can be applied straightforwardly in bins 
of redshift and Mb that are completely sampled by the 
DEEP2 survey. Within each redshift bin, however, there 
is a color-dependent threshold luminosity, below which 
DEEP2 constitutes a partially incomplete sample and 
a minimum luminosity below which no DEEP2 galax- 
ies are observed. Examples of these limits are shown in 
Figure [6] Because the DEEP2 i?-band selection is per- 
fomed in increasingly blue rest-frame bands as redshift 
increases, the bias against faint red galaxies also wors- 
ens. To determine whether we are working in a partially 
incomplete bin while assigning colors, in each bin we first 
perform a test color assignment on a fraction of the mock 
galaxies. We then compute i?-band apparent magnitudes 
for this sample, as described in Section 13.3.31 below, and 
check for values that fall below the DEEP2 magnitude 
limit, R = 24.1. Assigning colors to mock galaxies in 

11 A detail: our magnitude binning is performed after subtract- 
ing off the evolution of Mg with scale factor a; that is, we construct 
bins of L/L* . 
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Figure 5. Schematic representation of our basic algorithm for 
adding galaxy colors to the volume-limited mock catalogs. Within 
a bin of redshift and absolute magnitude (0.8 < z < 0.9 and —21 < 
Mg — 5 log h < — 20 is shown here) , we divide the DEEP2 data into 
five bins of local density $3, containing equal number of galaxies. 
These bins are denoted by gray lines and numbers in the top panel. 
We divide the mock galaxies similarly into quintiles of St (bottom 
panel). We then assign rest-frame U — B colors to the mock galaxies 
in each density quintile according to the color distribution in each of 
the corresponding DEEP2 density quintiles, after weighting these 
distributions to account for the color-dependent redshift failure 
rate (middle panels). 

such bins will be more complicated than just described. 
To make this process easier, we work through the red- 
shift and luminosity bins in increasing order of redshift 
and decreasing order of luminosity, to ensure that, for 
each partially incomplete bin, there is a nearby complete 
bin at lower redshift or brighter luminosity that has al- 
ready been populated with mock colors. 

In partially incomplete bins, it will be necessary to 
exclude some galaxies from the mock catalog by assign- 
ing them rest-frame colors that will put them below the 
DEEP2 apparent magnitude limit. We have no direct in- 
formation about the density distribution of galaxies be- 
low the magnitude limit, but because the incompleteness 
depends on color, it is likely that it also has some cor- 
relation with galaxy environment. Our task, then, is to 
determine (a) how many galaxies to exclude, (b) with 
what distribution in local density, and (c) what colors to 
assign them so that they drop out of the mock DEEP2 
survey. 

The first step is relatively simple. To get a rough es- 
timate of the number of mock galaxies to discard, we 
can simply compare the number density, n moc k, of mock 
galaxies in the redshift-luminosity bin in question and 
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Figure 6. An illustration of redshift and color-dependent incom- 
pleteness in DEEP2. The panels show rest-frame color-magnitude 
diagrams for two different redshift bins, as computed using the 
fc-corrections of W06. The sharp, tilted cutoff at the faint end 
corresponds to the DEEP2 R = 24.1 magnitude limit at the lower 
redshift limit of each bin. Vertical gray lines in each panel show the 
point at which the galaxy sample in each bin becomes partially in- 
complete (i.e., some red galaxies start to drop below the magnitude 
limit at the highest redshifts in the bin) and the point at which all 
galaxies are lost from the sample. (The very faint outlier in the 
upper panel is an example of a galaxy with an incorrectly identified 
redshift. Such objects are rare and are excluded by construction 
from our color-assignment algorithm.) 

compare it to the number density of DEEP2 galaxies 
in the same bin, nDEEP2, after weighting the DEEP2 
galaxies with the W06 incompleteness-correction weights 
(which account for targeting incompleteness and redshift 
failures, but not incompleteness owing to the apparent 
magnitude limit). However, one expects some field-to- 
field variance in the mock owing to sample variance and 
shot noise, and we would obviously like this variance 
to be reflected in the mocks. We can straightforwardly 
compute the impact of this scatter on a particular mock 
lightcone by integrating the input luminosity function 
(see Equatior(2] and Table [2]) over the luminosity range 
of the bin in question and comparing the result to the 
actual number density in the mock lightcone being con- 
sidered. This gives the size of the deviation from the 
mean DEEP2 number density in that particular redshift 
bin of that particular lightcone. Then the target number 
density of observed mock objects in that bin is given by 



^targ 



™DEEP2™mock 



(7) 



where L mm and L max are the upper and lower limits of 
the luminosity bin being considered. Having computed 
this target number density, we can proceed to exclude 
galaxies from the mock until ntarg is reached. 

Which galaxies should we exclude? It is extremely 
likely that the incompleteness in DEEP2 is a function 
of local environment, since it depends strongly on color, 
and it is important that we ensure that the overdensity 
distribution in the mocks is comparable to the one in 
the data before we perform color assignment. Since we 
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Figure 7. A schematic diagram showing our technique (described 
in detail in the text) for rejecting galaxies in a local-density- 
dependent way from bins of luminosity and redshift in which the 
DEEP2 sample is incomplete. 

cannot directly measure the overdensity distribution of 
unobserved DEEP2 galaxies, we will need to find some 
means of inferring it. Our approach is to make compar- 
isons to the nearest complete bin at the same luminosity 
but lower redshift. The method is illustrated schemati- 
cally in Figure [Jj We start by dividing this lower redshift 
bin into quintiles of overdensity, both in the data and in 
the mock. Then, holding the overdensity bins fixed, we 
divide our incomplete bin into the same five bins of over- 
density, in both the mock and the data. This is shown 
in the left-hand panels of the Figure, where the low and 
high-redshift bins are denoted by solid and dashed lines, 
respectively . 

We then compare the ratios of the total number counts 
in each of the five overdensity bins (right-hand panel of 
the Figure). It is to be expected that the distribution of 
overdensity values in these bins will evolve in the mock 
catalog, owing to the evolution of cosmic structure (green 
curve). Because of density-dependent incompleteness, we 
expect that the distribution will appear to evolve differ- 
ently in the data (violet curve). By taking the ratio of 
the relative change in the DEEP 2 distribution to that 
in the mock (black curve), we can compute the number 
density of galaxies that ought to remain in the ith bin of 
overdensity after accounting for incompleteness: 

n targ( Z ) _ 7 4)EEP2( Z ) ( "mock( Z ) \ "targ 
"mock(^) n DEEP 2 ( 2; 0) V^mockM/ ™DEEP2 

where zq is the redshift of the nearest complete bin at this 
luminosity, and the final factor accounts for the expected 
sample variance in the mock lightcone in question (see 
Equation [Jj . 

Having set these targets for the number denisity of 
mock galaxies in each bin of environment, we proceed 
to exclude galaxies from the sample as follows. Using 
the overdensity binning that we established in the lower- 
redshift complete bin at Zq, we assign provisional colors 
to the mock galaxies as described above, using the lower- 
redshift complete DEEP2 sample as the input sample. 
We then convert the mock galaxies colors and absolute 
magnitudes to apparent i?-band magnitudes as described 
below. Since the bin we are considering is incomplete, 
some of these objects will have R > 24.1 and hence will 
drop out of the observed sample in the mock. For these 



galaxies, we retain the colors we just assignecQ and ex- 
clude them from the sample; we erase the colors for the 
rest. We then repeat this procedure on the remaining 
galaxies until we have excluded enough galaxies to reach 
n^ arg in this bin. 

For the remaining mock galaxies in this bin, we then 
assign colors exactly as we did in the complete bins, by 
binning both the mock galaxies and the DEEP2 galaxies 
at the current redshift into quintiles of overdensity, then 
drawing DEEP2 colors at random from within each quin- 
tile to assign to the mock galaxies in the corresponding 
density bin. There is one small complication remaining, 
however. This bin has an incomplete region in color- 
magnitude space, and it is possible that some of the color 
assignments in the remaining mock galaxies will fall in 
this region. To guard against this, we test our color as- 
signments by computing apparent i?-band magnitudes 
as described below. For any galaxies with R > 24.1, 
we erase their colors and draw new ones, repeating this 
procedure until all the remaining mock galaxies have col- 
ors that place them above the DEEP2 magnitude limit. 
This completes the task of assigning colors to the mock 
in partially incomplete bins of redshift and luminosity. 

For the faintest luminosity bins, where DEEP2 data 
is entirely lacking, or where there is no complete bin 
at lower redshift, we simply extrapolate from the color- 
density relation we used to populate the next-brightest 
bin, with one modification. Because galaxy luminosity 
and color are correlated, we shift the colors that we assign 
to the faint mock galaxies according to the mean relation, 
treating red galaxies and blue galaxies separately. First, 
we divide the DEEP2 sample into red and blue galaxies 
according to the division used by W06. We then fit for 
the mean linear color-magnitude relation of each of these 
samples seperately, finding that red galaxies redden by 
0.03 magnitudes per unit increase in absolute magnitude, 
while blue galaxies redden with a slope of 0.1 magnitudes. 
We use these relations to shift the assigned colors blue- 
ward, according to the difference in magnitude between 
the bright input galaxies and the faint mock galaxies that 
are having colors assigned. In this case, the input galax- 
ies we use to assign colors are not DEEP 2 galaxies but 
rather the galaxies in the next-brightest bin of the mock, 
since the DEEP 2 sample will be incomplete, but this 
has been accounted for in the mock, as described above. 
(Thus, in this case, we use 87 as the density measure for 
both samples, but the algorithm is otherwise identical.) 
The color-density relation this produces will almost cer- 
tainly be incorrect, but this is irrelevant for any practical 
use of these mocks, and in any case the situation cannot 
be improved without more data. 

3.3.3. Computing apparent magnitudes 

Once we have assigned rest-frame B-band luminosities 
and U — B colors to the mock galaxies, we can convert 
these to apparent i?-band magnitudes wir by empirically 
inverting the ^-correction procedure that was used to 
compute the DEEP2 rest-frame values in W06. The rest- 
frame values are related to the observed magnitude by 

m R = M B + D{z) - k, (9) 

12 If the number of galaxies with R > 24.1 is larger than the 
number needed to reach fij arg , then we retain the colors for only 
faintest ones up to this number. 
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where k, the fc-correction, is a function of U — B and z; 
and D(z) is the distance modulus. To obtain the appro- 
priate A;-correction for our mock galaxies, we infer it from 
the values computed for the DEEP2 sample as follows. 

First, we divide the DEEP 2 sample into narrow bins 
of width 0.01 in redshift. Within each of these bins, we 
fit a third-order polynomial to the relation between the 
fc-corrections and the rest-frame U — B colors that were 
computed in W06. As shown in the Appendix of that 
paper, k(U — B) is a smooth, nearly linear function at 
fixed redshift, so our third-order polynomial fit will cap- 
ture the relation well, and extrapolating a small distance 
into incomplete regions of color space will not be a prob- 
lem. We use the fit thus obtained to compute tur for 
each mock galaxy according to Equ ation HI Thi s tech - 
nique is identical to the one used in iGerke et al.l ()2007f ) 
to compute observed magnitudes in the YWC mocks af- 
ter assigning rest-frame colors in a manner similar to the 
one described above. 

4. SIMULATING DEEP2 OBSERVATIONS 

The mock DEEP2 lightcones we constructed in the 
previous section should fully capture the impact of the 
DEEP2 R = 24.1, but to fully characterize the DEEP2 
selection it will also be necessary to account for the other 
observational effects present in the survey. These effects 
comprise the DEIMOS slitmask-making algorithm that 
schedules galaxies for spectroscopic observation; contam- 
ination by foreground stars and background galaxies; and 
the magnitude and color-dependent failure rate for ob- 
taining reliable redshifts. We describe our methods for 
modeling each of these effects in turn below. 

4.1. Mask-making: The DEEP2 targeting algorithm 

DEIMOS is a multiplexed slit spectrometer, in which a 
custom-made mask is placed in the focal plane, with a slit 
cut in the position of each object that has been scheduled 
for observation. A number of practical constraints con- 
trol the number of objects that can be observed in this 
manner. First, since the light is dispersed along a partic- 
ular direction within the spectrograph, the slits must be 
oriented roughly parallel to an axis perpendicular to the 
spectral direction (within DEEP2 the slits are allowed to 
be tilted by up to 30 degrees from this axis). To avoid 
overlapping spectra on the detector, slits also may not 
overlap along this spatial axis. To allow for sufficient 
sampling of night-sky emission for sky subtraction, the 
slits must be long enough to allow empty regions of sky 
to be observed on cither side of the target galaxies; this 
limits slit lengths to be no shorter than three arcseconds 
in DEEP2. 

These limitations mean that galaxies in crowded re- 
gions on the sky will be less likely than average to be 
assigned to a slit on a given mask. To mitigate the ef- 
fects of slit-crowding, DEEP2 tiled the survey region with 
slitmasks in an overlapping pattern, so that nearly all 
galaxies have two or more opportunities to be assigned 
to a slitmast F^l. The DEEP2 slitmask cre a tion a lgorithm 
is outlined in iDavis. Gerke. fc Newman! (120041 ) and are 
described in detail bv lNewman et al.l ( 20121) . 

The mock DEEP2 lightcones we described above are 
constructed to have an identical geometry to the three 

13 As discussed below, this increases to four chances in the EGS. 



main DEEP2 fields, so the DEEP2 slitmask-making al- 
gorithm can be applied directly to the mocks to select 
mock galaxies for spectroscopic "observation" (though 
the mock lightcones must first be divided into three sub- 
cones, corresponding to the three photometric pointings 
making up each DEEP 2 field). 

The remaining complication is the pre-selection that 
is performed in observed BRI color-color space in the 
DEEP2 photometry. Since we have only produced ob- 
served R band apparent magnitudes, we cannot directly 
apply this selection to the mock catalog. The color-color 
pre-selection is tuned to select a nearly complete set of 
galaxies at z > 0.75, while exclu ding nearly all galaxies 
at z < 0.7 (|Newman et al.l l2012). so it should be possi- 
ble to approximate it as a redshift-dependent selection 
weight. DEEP2 galaxies in the EGS were selected with- 
out this pre-selection, so we can use this field to test the 
efficacy of the color selection and convert it into a redshft 
selection. The EGS sample confirms that the DEEP2 
color cuts are a highly efficient means of selecting high- 
rcdshift galaxies: only a few percent of z > 0.75 galaxies 
in the EGS are excluded by the cuts (and many of these 
have large photometric errors), while a large fraction of 
lower redshift objects are excluded. Apart from a small 
residual population of foreground galaxies (which we deal 
with below), the DEEP2 color pre-selection can be ac- 
curately approximated as a redshift-dependent selection 
probability as follows: 



Psel(z) = fl 



i e -43.7(z-0.725) 



(10) 



We apply this selection probability to the mock galax- 
ies before passing them through the DEEP2 slitmask 
making algorithm. We also exclude all galaxies with 
z > 1.4 in this selection step, since the [Oil] AA3727 dou- 
blet leaves the DEEP2 spectral window at this redshift, 
so the redshift success rate of DEEP2 galaxies drops dra- 
matically at higher z. We will account for these higher 
redshift objects, as well as foreground objects and stars, 
below. This accurately reproduces the redshift distri- 
bution of DEEP2 in the range 0.7 < 0.7 < 1.4, but it 
does not correctly account for the color dependence of 
the selection near the redshift cutoff. For this reason, 
the colors of "observed" galaxies in the main (non-EGS) 
DEEP2 mock catalogs presented here should be treated 
with caution at z < 0.75. 

4.1.1. Foreground and background objects 

The redshift selection function described above accu- 
rately captures the DEEP2 selection in the primary red- 
shift range 0.75 < z < 1.4. However, it does not ac- 
count for the small but significant fraction of DEEP2 
targets that do not lie in this range. These include ob- 
jects in three categories: foreground (z < 0.75) galax- 
ies that nevertheless pass the DEEP2 color pre-selection, 
foreground stars that have been misclassified as galax- 
ies in the DEEP2 photometry, and background objects 
at z > 1.4. Since objects in each of these categories 
use up slit length that could otherwise be assigned to 
galaxies in the primary DEEP2 range, it is important to 
account for their effects on the DEEP2 slitmask-making 
algorithm. Since each of these samples is highly incom- 
plete with complicated selection, it is nearly impossible 
to accurately reproduce the properties (e.g., colors) of 
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the objects in each set. Since such objects are not typ- 
ically the focus of scientific analysis, and since we only 
wish to include their impact on the spectroscopic target- 
ing, it is reasonable to include them in a random way as 
described below. 

The sample of DEEP2 targets with reliable redshifts 
in the three non-EGS fields consists of roughly 1% stars 
and 4% galaxies at z < 0.75. Follow-up of the DEEP2 
rcdshift failures with UV spectroscopy shows that an ad- 
ditional ~ 15% of DEEP2 targets lie at z > 1.4 (C. Stci- 
del, private communication). To contaminate the sample 
with stars before applying the maskmaking algorithm, we 
draw i?-band apparent magnitude values from a Gaus- 
sian distribution with mean 21.9 and dispersion of 1.5 
magnitudes. We draw from this distribution and create 
potential spectroscopic targets ("stars") at redshift zero 
in sufficient numbers to constitute 1% of the targets. To 
produce foreground and background galaxies, we select 
galaxies at random from the populations at z < 0.75 
and z > 1.4 in appropriate proportions to match what is 
found in DEEP2, and we add these to the pool of poten- 
tial spectroscopic targets. Once these contaminants are 
added to the mock catalog, we execute the DEEP2 slit- 
mask making procedure on this sample, producing a list 
of objects that would be scheduled for DEIMOS observa- 
tion in a real survey. This constitutes our mock DEEP2 
spectroscopic sample. 

4.2. Simulating redshift failure 

Approximately 30% of DEEP2 spectroscopic targets 
do not yield good redshifts (|Newman et al.ll2012l) . As we 
mentioned above, roughly half of these lie in the so-called 
redshift desert at z > 1.4, where no strong galactic spec- 
tral features fall in the optical wavelength range. The 
remaining 15% fail to yield redshifts for a variety of rea- 
sons, most of which lead the observed spectrum to have 
a low signal to noise ratio. This redshift failure rate has 
some dependence on apparent magnitude and color: for 
example, faint red galaxies combine low flux with weak 
spectral absorption features, making them particularly 
susceptible to rcdshift failure. If we wish our mock cata- 
logs to accurately model the selection function of galax- 
ies with DEEP2 redshifts, it is important that we include 
color and magnitude dependence of redshift failure in our 
catalogs. 

W06 developed a weighting scheme that accounts 
for color and magnitude-dependent incompleteness in 
DEEP2 by binning the spectroscopic targets in observed 
color-color-magnitude space and comparing the number 
of successful redshifts in each bin to the total number of 
photometric objects (making some justified assumptions 
about the redshift distribution of the spectroscopic tar- 
gets that failed to yield a redshift). These weights can 
be straightforwardly inverted to yield an estimate of the 
redshift failure probability for a galaxy in a particular re- 
gion of photometric space. To apply this probability to 
the mocks, when we assign rest-frame (U — B) colors to 
the mock galaxies by randomly drawing DEEP2 objects 
from within a bin of density, we also assign each DEEP2 
galaxy's incompleteness weight ui corr to the mock galaxy 
that receives its color. 

We use the "optimal" set of weights from W06 for 
this purpose, with two small modifications. The weights 
account for incompleteness owing both to redshift fail- 



ure and for targeting incompleteness (i.e., the failure 
to schedule all suitable photometric objects for spec- 
troscopy). Since we have already accounted for the latter 
type of incompleteness in the mock by running it through 
the DEEP2 slitmask-making algorithm, we must correct 
for this to avoid double-counting the targeting incom- 
pleteness. The slitmask-making algorithm does not de- 
pend on color or apparent magnitude, so the correction is 
simple: we need only multiply each weight by the fraction 
of potential targets that are scheduled for spectroscopic 
observation. (For galaxies in the EGS, it is also necessary 
to multiply out the redshift-dependent selection weight- 
ing described in the next section, which is straightfor- 
ward.) In addition, there is a sharp drop in redshift suc- 
cess for the faintest 0.5 magnitudes of the DEEP2 sample 
(i.e., galaxies with 23.6 < R < 24.1). This is accounted 
for in the weighting scheme, but there is no guarantee 
that a mock galaxy will have exactly the same apparent 
R magnitude as the DEEP2 galaxy we used to assign 
its weight. Hence, if either the source DEEP 2 galaxy or 
the target mock galaxy has R > 23.6, we apply a lin- 
ear correction to the mock galaxy's weight with a slope 
of 0.13 per unit magnitude difference between the target 
and source galaxy (applying a minimum of R = 23.6 on 
both galaxy magnitudes when computing the difference). 

Each galaxy in the mock can then be assigned a prob- 
ability of yielding a successful redshift, p z — 10^*1-1 where 
u> CO ri is the W06 incompleteness weight, corrected for the 
targeting efficiency. We then use these probabilities to 
select stochastically a subsample of the spectroscopic tar- 
gets that we declare to have "failed" redshifts, until the 
fraction of redshift failures (including both these stochas- 
tic failures and background objects at z > 1.4) matches 
the value in DEEP2. We will refer to the remaining tar- 
gets as the "observed" mock galaxies. With this step, 
we have fully accounted for the color and magnitude- 
dependent DEEP2 selection effects in the mock catalogs, 
to the extent that it is possible to do so. 

4.3. The Extended Groth Strip 

In the EGS, spectroscopic targets are drawn from all 
regions of color-color space; no pre-selection is applied to 
exclude low-redshift galaxies. (However, as described be- 
low, the selection is weighted in color-color space, to stop 
the sample from being dominated by the more numerous 
low-z galaxies.) A sample that is magnitude- limited at 
R = 24.1 and extends to low redshift will include many 
nearby galaxies with very low-luminosity — and hence low 
mass. According to the W06 ^-corrections, galaxies in 
the EGS have absolute magnitudes as low as Mb ~ —11, 
similar to the Leo dwarfs; the abundance-matching pre- 
scription we developed in Section 13.21 would place such 
galaxies in halos with masses below 1O 1O M0. To include 
such low-mass halos in N-body models with cosmolog- 
ically interesting volumes requires very high-resolution 
simulations, such as Bolshoi, which have only been com- 
putationally feasible within the last few years. The use 
of such simulations is one of the primary improvements 
of this work over the earlier mock catalogs of YWC04; it 
allows us to construct the first realistic mock catalog that 
includes the full range of redshift and luminosity probed 
by the EGS spectroscopic dataset. 

When constructing the DEEP2 mock catalogs, we 
made use of data covering the full redshift range of EGS, 
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< z < 1.4, making use only of data from the EGS at 
z < 0.75, where the primary DEEP2 spectroscopic sam- 
ple is highly incomplete. Thus, our mock galaxies cover 
the full range of photometric and redshift space that is 
probed by DEEP2 spectroscopy in the EGS. However, 
there remain differences between the EGS dataset and 
the rest of DEEP2 that we must account for in con- 
structing an EGS mock. The first is the field geome- 
try and orientation. The three non-EGS DEEP2 fields 
are 0.5° x 2° rectangular fields with their long axes ori- 
ented along lines of constant declination. The EGS, by 
contrast, is half as wide, at 0.25° x 2°, and is oriented 
perpendicular to the ecliptic, to allow easier access by 
space-based instruments. To account for these differ- 
ences when construcing EGS mocks, we rotate the co- 
ordinate system of each of our mock lightcones to match 
the orientation of EGS and then excise the central 0.25° 
strip to use in EGS-like maskmaking. It is important to 
note that this approach means that our EGS lightcones 
are not independent from our main DEEP2 mocks; in- 
stead, each EGS cone corresponds exactly to one of the 
primary DEEP2 mocks. 

The remaining differences between the EGS and the 
rest of DEEP2 involve the details of spectroscopic tar- 
get selection. As we mentioned above, spectroscopic 
targets in the EGS are chosen from all regions of pho- 
tometric color space, in contrast to the main DEEP2 
sample, where a color pre-selection is applied. However, 
since a simple, magnitude-limited survey will always be 
dominated by the numerous fainter objects at low red- 
shift, and since DEEP2's primary goal is to explore the 
z ~ 1 universe, the EGS spectroscopic target selection 
is weighted in color space, so as to preferentially select 
high-redshift objects. The particulars of t h is wei ghting 
are discussed in detail by iNewman et al.l (|2012f ). We 
briefly summarize it here. Galaxies in the EGS pho- 
tometric sample are divided according to the same color 
pre-selection as is used in the primary DEEP2 fields. All 
galaxies that pass this selection cut (i.e., the ones that 
would be observed in the rest of DEEP2), along with all 
galaxies brighter than R = 21.5, receive a uniform selec- 
tion probability po- For fainter galaxies that fail the se- 
lection cut in color space, the probability of spectroscopic 
observation is a function of flux, chosen so that roughly 
equal numbers of galaxies are observed above and below 
z = 0.75. This se l ection weighting is described in detail 
in INewman et al.l (|2012l ). 

We apply this same probabilistic selection weighting 
in the spectroscopic targeting algorithm we use in con- 
structing the EGS mocks, with the difference that the se- 
lection in color space is replaced with the selection in red- 
shift given in EquationHOl The EGS targeting algorithm 
also differs from the primary DEEP2 selection algorithm 
in its approach to tiling the observational area with slit- 
masks. Whereas the main DEEP2 survey strategy uses 
overlapping masks with a two-pass approach, the EGS 
targeting algorithm gives most galaxies four chances to 
be scheduled for observation, with one half of the masks 
oriented perpendicular to the other half, to further re- 
duce crowding effects. We use an identical algorithm to 
"schedule" our mock EGS galaxies for spectroscopic ob- 
servation. Then, after accounting for redshift failures as 
described in the previous section, we have a simulated 
DEEP2 EGS spectroscopic catalog that accounts for all 



0.50<z<0.75 




0.75<z<1.00 





1.00<z<1.30 




Figure 8. Comparison of the color-magnitude diagrams, in bins of 
redshift, for DEEP2 (upper row) and the mock catalogs presented 
here (lower rows). For the mock panels, black points show galaxies 
that were scheduled for observation and received a good redshift, 
while unobserved galaxies and redshift failures are shown in gray. 
The sources of the striping patterns visible in the mock panels are 
explained in the text. 

major color and luminosity-dependent selection effects. 

5. COMPARISONS TO DATA 

Our aim is to produce mock catalogs by populat- 
ing N-body models to reproduce various properties of 
the DEEP2 sample as accurately as possible. These 
include the luminosity function, luminosity-dependent 
two-point correlation function, color-magnitude diagram, 
and color-luminosity-environment relation — as well as 
the evolution of each of these with redshift. As discussed 
above, there is good reason to believe that reproducing 
these properties simultaneously will also correctly repro- 
duce the underlying relation between luminosity, color, 
and dark-matter halo mass, thereby allowing the DEEP2 
halo-mass selection function to be inferred. 

In order to gauge our success in reproducing DEEP2 
properties, it is obviously crucial to make direct compar- 
isons to the data. This will also help to reveal any limita- 
tions of our mocks. We have already made comparisons 
above between our mock catalogs and the DEEP2 lumi- 
nosity function (which our algorithm reproduces by con- 
struction; Figure and the projected two-point correla- 
tion function (Figure [1} for different luminosity thresh- 
olds. Because of the evolving color-dependent selection 
cut imposed by the DEEP2 magnitude limit, it is also 
important to test the color, environment, and redshift 
properties of the mocks against the data as well. We 
make these comparisons in the remainder of this section. 
Throughout, we will use the Bolshoi mock catalogs as our 
basis for comparison to the data, since its background 
cosmology is in the best agreement with current con- 
straints, but it will also be interesting to investigate the 
impact that different cosmological assumptions have in 
our three sets of mocks. We will conclude the section by 
briefly considering this issue. 

5.1. The color-magnitude diagram 

The first requirement for reproducing the DEEP2 
color-dependent selection effects in our mocks is that we 
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accurately reproduce the DEEP2 color distribution, its 
correlation with luminosity, and its redshift evolution. 
Figure [8] shows the rest-frame color-magnitude diagram 
for DEEP2 and the mock catalogs. For the mocks, both 
observed galaxies (black points) and unobserved galax- 
ies (gray points) are shown. The qualitative features of 
the DEEP2 color-magnitude distribution are broadly re- 
produced in the mocks: distinct red and blue sequences 
are evident at all redshifts, with the correct loci in color- 
magnitude space and the correct color-magnitude cor- 
relations for observed galaxies; and the color-dependent 
selection cut in the mocks matches the one in the data 
(apart from a small number faint outliers in the data, 
which arise from incorrect redshift assignments). 

Clear artifacts of our color- assignment process are evi- 
dent in the mock diagrams, however, in the form of strip- 
ing patterns. Two different kinds of stripe are appar- 
ent: horizontal stripes at bright magnitudes (especially 
at low redshift), and diagonal stripes for fainter objects. 
The latter stripes result from our method for assigning 
colors in luminosity bins where DEEP2 is incomplete, 
described in Section 13.3.21 In these bins, we have used 
the colors from brighter galaxies, shifted according to the 
mean color-magnitude relations for red and blue galaxies 
in DEEP2; the diagonal stripes for faint galaxies corre- 
spond to these relations. Horizontal striping occurs when 
the number of galaxies in a given DEEP2 luminosity- 
redshift bin is comparable to or smaller than the number 
of galaxies in the corresponding mock bin. This occurs 
especially at low redshift, since the ECS field has only 
half the area of the rest of the DEEP2 fields. For this 
reason, the low-redshift mock color-magnitude diagrams 
show a substantial striping and are extremely noisy. 

We have not shown the z < 0.5 diagram in Figured! 
since this source of noise is so great there as to make 
comparison to the data uninformative. Since the com- 
parison in the 0.5 < z < 0.75 bin is not especially helpful 
either, for the remainder of this section we will focus 
on comparing the mocks with the high-redshift DEEP2 
fields, excluding the ECS. If our techniques are success- 
ful at reproducing the high-z DEEP2 properties, then 
they should be equally successful in the ECS, within the 
confines of the limited data available in that region. 

To allow a more quantitative comparison of the color 
and magnitude distributions, we plot histograms of ap- 
parent magnitude, absolute magnitude, and rest-frame 
color, over two different redshift ranges, in Figure [9l 
To make direct comparisons between the data and the 
mocks, we choose a subset of the 40 Bolshoi light cones 
with equal area to that covered by the high-redshift 
DEEP2 fields; this constitutes a single mock realiza- 
tion of the DEEP2 sruvey. The figure shows histograms 
for the DEEP2 data (black solid lines) and for the ob- 
served galaxies in four different mock realizations (col- 
ored dashed lines); the four realizations give a sense of 
the scatter in these distributions. The agreement is quite 
good in general, although there is a small but systematic 
overabundance of faint, blue galaxies, especially at higher 
redshifts. 

There are two explanations for this. First, we have 
used the best fit to the evolving DEEP2 luminosity- 
function paramters to populate our mocks, rather than 
using the best fitting luminosity function at each red- 
shift. This means that the mock luminosity function may 
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Figure 9. Comparison of the apparent and absolute magnitude 
and rest-frame color distributions for the DEEP2 sample in the 
three high-redshift fields (black solid curves) and for four different 
mock realizations of identical volume to those fields (colored dashed 
curves). 

lie above or below the DEEP2 one in a given redshift 
bin. As shown in Fig 21 the input luminosity function 
(dotted curve) lies above the best-fitting DEEP2 lumi- 
nosity function in the 1.0 < z < 1.2 bin. This explains 
why the high-z absolute magnitude distributions for the 
mocks lie systematically above the DEEP2 distribution, 
for example. However, the excess appears to be more sig- 
nificant at fainter magnitudes, and this faint-end excess 
persists in the low-redshift bin, where the input luminos- 
ity function matches the DEEP2 best fit more closely. 
This is likely explained by inaccuracy the faint-end slope 
of the input luminosity function. W06 held this param- 
eter fixed at a value of 1.30, since DEEP2 does not ex- 
tend to faint enough magnitudes for a robust fit, and we 
have adopted this value here (Table [5]). Inspection of 
Figure 7 of W07 suggests that this fixed faint-end slope 
may slightly overestimate the faint-galaxy abundance in 
DEEP2 at z > 0.6, so it is not too surprising that the 
mocks show a slight excess of faint objects. This ex- 
cess translates into the systematic overabundance of blue 
galaxies that is also apparent in Figure |H1 since all of the 
faintest galaxies in DEEP2 are blue galaxies. 

5.2. The observed redshift distribution 

Another test of the color and magnitude distribution 
in the mocks is to compare the redshift distribution in 
the mocks to the DEEP2 distribution. Reproducing the 
redshift distribution is not trivial: since the DEEP2 mag- 
nitude limit is a strong function of redshift and rest- frame 
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Figure 10. Comparison of the redshift histograms for the DEEP2 
sample in the three high-redshift fields (black solid curves) and for 
four different mock realizations of identical volume to those fields 
(colored dashed curves). The mock realizations and their color 
coding are the same as was used in Fig [9] The panels show all ob- 
served galaxies (top), galaxies brighter than Mg = — 19 (middle), 
and galaxies brighter than Mb = — 20 (bottom). 

color, an accurate redshift distribution requires accuracy 
in the overall galaxy abundance, color distribution, color- 
luminosity relation, and the evolution of each of these 
with redshift. This means that the redshift distribution 
is a strong test of our mock-making techniques. Figure lTOl 
shows the redshift distribution for DEEP2 and for the 
same four mock realizations used above, for all observed 
galaxies, and for two different thresholds in luminosity. 
The agreement is quite good, considering the scatter be- 
tween the different mock realizations, except for a small 
but systematic overabundance of galaxies over the range 
1.05 < z < 1.3. As explained in the previous section, this 
excess can be explained by the difference between our 
input luminosity function and the true best-fit DEEP2 
luminosity function over this redshift range. It would be 
possible to improve the agreement here by using the true 
best-fit DEEP2 luminosity function in each redshift bin, 
but this would be at the expense of a smoothly evolving 
luminosity function and would likely create unphysical 
and undesirable jumps in the mock galaxy abundance at 
particular redshifts. We thus consider the current dis- 
tribution to be the best that can be achieved with the 
available data. 

5.3. The relation between color and environment 



As we discussed in Section 13. 3[ if we are going to re- 
liably infer the DEEP2 halo selection function, it is es- 
sential that our mock catalog accurately reproduce the 
relation between galaxy color and environment, as well 
as the redshift evolution of this relationship. We have 
attempted to ensure this by assigning colors to our mock 
galaxies as a function of the local galaxy overdensity. 
Our method for doing this relied on comparing density 
values measured in the magnitude-limited DEEP2 sam- 
ple to values measured in the volume-limited mock, and 
so we matched the colors in bins of relative overdensity, 
using the S3 parameter in the data and 67 in the mock. 
There is no obvious guarantee that this will translate 



to a mock that matches the absolute color-density rela- 
tion from DEEP2. It is important that we compare the 
color- environment relation in the observed mock with the 
DEEP2 data, to see how well the technique has worked. 

This comparison is shown in Figure [TT] The left panel 
shows the median third-nearest-neighbor galaxy overden- 
sity 83, in bins of rest-frame U—B color, over the redshift 
range 0.8 < z < 1.0. The black diamond poin t s con - 
stitute an update to the results of iCooper et al.l (2006) 
and show the DEEP2 color-density relation for the fi- 
nal DEEP 2 galaxy sample. The red triangles show the 
same quantities for the mock catalogs, where $3 has been 
computed using only the observed mock galaxies (i.e., 
those that were targeted for observation and received a 
valid redshift). The agreement between the two samples 
is remarkable, especially given the large number of se- 
lection effects (apparent-magnitude limit, spectroscopic 
targeting, and color-dependent redshift failure) that were 
applied to the mock catalog after the initial color assign- 
ment was done. This figure constitutes a strong confir- 
mation that our method for adding colors in quintiles of 
environment accurately reconstructs the DEEP2 color- 
environment relation. Although this figure shows only 
median values of #3 over a narrow redshift range, we 
have confirmed that the full £3 distributions have sim- 
ilar shapes and that similar results obtain for different 
redshift ranges. 

Because the DEEP2 color-en vironment relation also 
evolves strongly with redshift (jCooper et al.1 l2007h . it 
is also important that we capture this evolution in the 
mocks as well. The right-hand panel of the igure com- 
pares this evolution in the data and the mocks. The 
black curves are an update to Figure 6 of ICooper et al.1 
(2007): they show the fraction of galaxies that are red 
in the most overdense and most underdense 20% of the 
DEEP2 sample, in sliding bins of redshift with width 0.1 
(where red galaxies are defined by splitting the red and 
blue sequences as in W06). The red fraction of under- 
dense galaxies is roughly constant with z, while the over- 
dense red fraction evolves strongly, especially at z > 1, 
so that the two populations are nearly indistinguishable 
in terms of red fraction at z = 1.3. The red curves in 
the figure show the same quanities for the mocks; the 
same qualitative behavior is evident, and the agreement 
between the mocks and the data is quite good over the 
full redshift range considered. 

5.4. The Halo- occupation distribution and the impact of 

cosmology 

Before we conclude, it will be interesting to explore the 
importance of the background cosmology on the accuracy 
of the mock catalog. Recall that we have constructed 
DEEP2 mocks for theree different simulation boxes with 
three different cosmologies (listed in Table [lj . Through- 
out all of the above comparisons, we have considered 
only the Bolshoi simulation, since its background cos- 
mology is the most consistent with current cosmological 
constraints, and we have found that the Bolshoi mocks 
will be useful for probing the DEEP2 mass selection func- 
tion at high halo masses. We would like to know the other 
mocks are likely to be similarly useful or not. 

The most direct way to test this is to consider the 
HOD. Since the halo-occupation probability will feed di- 
rectly into the inferred halo selection function, if this 
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Figure 11. Comparison of the color-environment relation, and its redshift evolution, in DEEP2 and the mock catalogs. Left: The median 
value of the local overdensity parameter 83 is shown in bins of rest-frame color, for the DEEP2 data (black points) and the mock catalogs 
(red points). Error bars show the standard error, and points have been offset slightly in the horizontal direction for clarity. Here £3 in the 
mocks has been computed using only observed mock galaxies, as for DEEP2. Right: A comparison of the evolution of the color-density 
relation in the data and the mocks. Curves labeled overdense and underdense show the fraction of red galaxies for the most overdense 
and most underdense 20% of galaxies, respectively, where red galaxies are defined according to the red-blue division in W06. Black curves 
show the red fractions in DEEP2, and red curves show the values from the mock catalogs. 
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Figure 12. Halo occupation distributions for the three different 
mock cosmologies, with diff erent luminosity thresholds, c ompared 
to the analagous HODs that Zheng, Coil, & Zchavi (2007) inferred 
for DEEP2. A clear dependence of the HOD on background cos- 
mology (explained in detail in the text) is evident. The L120 and 
Bolshoi mocks agree reasonably well with the data in all but the 
brightest bin (where the inferred DEEP2 HOD, as discussed in the 
text, is highly uncertain and problematic). 

quantity depends strongly on cosmology it will be es- 
pecially important to make sure our simulation box has 
a background cosmology that is consistent with exist- 
ing constraints. If the dependence is weak, then this 
is less critical. We can compute the HOD for each of 
our mocks by dividing the dark matter halos into bins 



of mass and counting the mock galaxies in each halo 
above a given luminosity threshold, and then averaging 
these counts over the mass bin. It will also be inter- 
esting to compare the s e mo ck HODs to the HOD that 
IZheng. Coil, fc Zehavil (12001 (hereafter ZCZ07) inferred 
from the DEEP 2 clustering measurements. This will con- 
stitute another comparison of the mocks to the data, al- 
beit a model-dependent one, since the inferred DEEP2 
HOD makes assumptions about the background cosmol- 
ogy- 
Figure [T^] shows the comparison for the same four lu- 
minosity thresholds that were used in ZCZ07. There is 
a clear difference between the mock from the L160 sim- 
ulation box and the other two boxes, with subtler differ- 
ences between the L120 and Bolshoi mocks. The reason 
for these differences is easy to explain. The luminosity- 
assignment algorithm inserts a fixed luminosity function 
4>{L) 1 regardless of the underlying halo mass function. 
Thus, if the overall mass function has a lower normaliza- 
tion, we must place more galaxies in halos of a given mass 
to reproduce the desired luminosity function. Hence, a 
simulation with a lower normalization for its halo mass 
function (e.g., lower Qm or erg will necessarily have a 
higher mean halo occupation number {N)(M) at fixed 
4>(L). Thus, the L160 mock, with its relatively low value 
of erg has a significantly higher HOD at all masses than 
the other mocks. The differences between the L120 and 
Bolshoi HODs are smaller because their cosmologies (see 
Table [1} lie near a degenerate curve in — 08 space; 
that is, there is little difference betwen their mass func- 
tions. It is worth noting, though, that the Bolshoi mock 
HOD lies systematically above the L120 one in the mass 
range 10 12 < M < 10 , which is an important transi- 
tion region between the central-dominated and satellite- 
dominated parts of the HOD. Because there is a clear 
dependence of the HOD on cosmology in these mocks, we 
recommend that the Bolshoi mocks be used for nearly all 
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purposes, since the others may give inaccurate selection 
functions. However, we also make the L120 and L160 
mocks available to the public, e.g., for the purposes of 
testing cosmology dependence. 

The Bolshoi and L120 mocks also agree reasonably well 
with the HODs inferred from DEEP2, for all but the 
brightest galaxy sample. Black curves show the HOD 
models from ZCZ07 that best fiQ th e DEEP2 clustering 
measurements from lCoil et all ((006bl) . This agreement is 
not surprising, since ZCZ07 assumed a background cos- 
mology identical to the one used in the L120 simulation. 
For the three lower luminosity thresholds, the Bolshoi 
and L120 HODs lie slightly below the DEEP2 measure- 
ment at high mass, and they cut off somewhat less steeply 
at low mass. There is a strong degeneracy in the ZCZ07 
HOD model, however, such that increasing the sharpness 
of the low-mass cutoff can be offset by lowering the high- 
mass slope while keeping the predicted autocorrelation 
function roughly constant, so an even better agreement 
is likely possible within the model uncertainties. Since 
our zero-scatter model effectively requires a sharp low- 
mass cutoff, it is also possible that a model with some- 
what higher scatter would be in closer agreement with 
the ZCZ07 results. 

In the panel with the brightest luminosity thresh- 
old, the agreement with the inferred DEEP2 HOD is 
quite poor. However, the ZCZ07 fit in this bin is quite 
poorly constrained and depends strongly on the correc- 
tions that were applied to account for small-scale ef- 
fects from crowding on the DEEP 2 slitmasks. Further- 
more, the ZCZ07 curve shown there is unphysical (as 
ZCZ07 discuss) , since it formally implies that halos with 
M = 2 x 10 11 contain more galaxies with Mb < —20.5 
on average than they do galaxies with Mb < —20. We 
thus do not find the disagreement at bright magnitudes 
to be a particular cause for concern. 

6. SUMMARY AND CONCLUSIONS 

We have constructed three sets of mock catalogs for 
the DEEP2 Galaxy Redshift Survey, derived from three 
different iV-body simulations with different background 
cosmologies. In doing this, we have striven to reproduce, 
as accurately as possible, the DEEP2 HOD, as well as 
all of the important galaxy properties that might have 
an impact on DEEP2 galaxy selection. Meeting these 
goals required us to develop new empirical techniques 
to assign colors to our mock galaxies based purely on 
the color-environment relation in the DEEP2 data, since 
physically motivated galaxy formation models tend not 
to reproduce this relation accurately at high redshift. 
With the release of this paper, we also release the mock 
catalogs for use by the general public. Instructions for 
downloading the mock catalogs and detailed information 
on their contents can be found in the Appendix, along 
with a few specific examples of applications for which 
these mocks might be useful. 

Our mock-making technique assigns mock galaxies to 
the halos and subhalos in the dark-matter-only simula- 
tions by directly imposing the empirical properties of 

14 The one exception to this is the lowest luminosity bin. In 
that bin, the formal best fit lies in a region of parameter space that 
ZCZ07 excluded on physical grounds, so in this panel we show a 
curve that lies within the parameter uncertainties and that approx- 
imately reproduces the curve in Figure 1 of ZCZ07. 



DEEP2 on the simulations. We start by stacking sim- 
ulation snapshots from different timesteps to construct 
lightcones with the geometry of DEEP2 fields, which 
accurately reproduces the redshift evolution of cosmic 
structure. We then use the subhalo abundance matching 
technique to assign galaxy luminosities to the halos and 
subhalos in each lightcone. This technique posits a tight, 
monotonic relation between the mass of a halo and the 
luminosity of the galaxy it hosts (with the caveat that 
the masses of subhalos should be measured before they 
are accreted into their parent halos). 

To assign galaxy colors in such a way as to reproduce 
their strong dependence on luminosity, redshift, and lo- 
cal environment, we bin the DEEP2 and mock galaxies in 
luminosity and redshift and further subdivide these bins 
into quintiles of local nth-nearest-neighbor overdensity. 
We then draw DEEP2 galaxies at random from these bins 
and assign them to the mock galaxies in the correspond- 
ing bin, taking care to account for incompleteness in the 
DEEP2 sample at faint magnitudes. Having produced 
mock galaxies with redshifts, luminosities, and colors, 
we can invert the fc-correction technique of W06 to as- 
sign apparent i?-band magnitudes to the mock galaxies. 
These allow us to apply the DEEP2 apparent magnitude 
limit to our mock sample. We can also apply a redshift- 
dependent selection that closely approximates the effect 
of the DEEP2 selection in color-color space. Given these 
mock DEEP2 targets, we can apply the DEEP2 spectro- 
scopic target-selection algorithm to reproduce the spatial 
selection effects inherent to multiplexed slitmask spec- 
troscopy. Finally, to account for observations that fail 
to yield a redshift, we exclude a fraction of the mock 
galaxies according to the color and magnitude-dependent 
redshift-failure rate of DEEP2 galaxies. 

This mockmaking technique accounts for all major ob- 
servational effects that pertain to DEEP2 galaxies. The 
resulting mocks accurately reproduce a wide array of 
properties of the DEEP2 catalog, including the luminos- 
ity function; the color, magnitude, and redshift distribu- 
tions; the color-environment relation and its evolution; 
and the inferred DEEP2 HOD for massive halos, pro- 
vided that the background cosmology of the simulation 
is in agreement with current constraints. This accuracy 
gives us confidence that our DEEP2 mock catalogs will 
allow us to infer the selection of DEEP2 galaxies with 
reasonable accuracy as a function of dark-matter-halo 
mass, as well as the relations between halo mass and 
galaxy properties. 

In the process of constructing the mock catalogs, we 
obtained two results that are of general scientific interest 
beyond the area of mock-catalog creation. First, when 
assigning galaxy luminosities using abundance match- 
ing techniques, we tried various different assumptions 
about the scatter in the mass-luminosity relation and 
compared the resulting projected two-point correlation 
functions w p (r p ) to the DEEP2 measurement. We find 
that no model with fixed log-normal scatter (in luminos- 
ity at fixed mass) gives simulated w p (r p ) consistent with 
the clustering measured in DEEP2. This is broadl y con- 
sistent with the results of iWetzel fc White! (|2010f ). who 
require a large value of the scatter to achieve even ap- 
proximate agreement with the DEEP2 clustering results. 
For reasons discussed in Section 13.2.21 we do not believe 
that these issues will create problems when using these 
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mocks to optimize group-finding algorithms, although 
they should be used with care for purposes that involve 
understanding the selection of low-mass halos. 

In addition, because we have constructed mock cat- 
alogs from three separate N-body simulations with dif- 
ferent cosmological parameters, we can probe the cos- 
mology dependence of our mock-making techniques. We 
find that the HODs in the different mock catalogs vary 
strongly with cosmology. This arises directly from the 
technique for luminosity assignment. Because abundance 
matching maps the observed galaxy luminosity function 
to the simulated halo mass function at fixed number den- 
sity, if the mass function dn/dM in a given simuation has 
a lower normalization than the true mass function in the 
universe, then the HOD resulting from the galaxy assign- 
ment must have a higher normalization and lower mass 
cutoff to compensate. For cosmologies with nearly iden- 
tical mass functions (e.g., points lying along the same 
degenerate curve in Qm-cs space), however, the impact 
on the HOD will be relatively small. 

Thus, it is important when constructing mock cata- 
logs from iV-body models to use a simulation whose as- 
sumed cosmology is in agreement with all existing con- 
straints, to ensure the best possible accuracy in the re- 
sulting HOD. For most applications, we therefore rec- 
ommend the mock catalogs we have constructed from 
the Bolshoi simulation, since it is consistent with the 
best cosmological constraints available. The mocks con- 
structed from the L120 and L160 simulations are likely to 
be generally useful only for testing the impact of varying 
assumptions about cosmology on any conclusions drawn 
from the mocks. 

Given that the Bolshoi mocks should accurately repro- 
duce the DEEP2 halo selection function, an immediately 
interesting use to which we can put them is testing and 
calibrating an algorithm for detecting groups and clus- 
ters of galaxies. W e have used them for this purpose in 
IGerke et all (|20l2l) . 
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APPENDIX 

THE PUBLIC CATALOGS AND EXAMPLE APPLICATIONS 

With the publication of this paper, we also release our DEEP2 mock catalogs for use by the general public. Mocks 
for each of the three different simulation boxes listed in Table Q] can be obtained as binary data tables in FITS format 
from the World Wide Web at http://www.slac.stanford.edu/~risa/deepmocks. In this section we describe the 
contents of the electronic mock catalog files and give a few examples of applications for which they might be useful. 

The catalogs 

We provide two sets of forty lightcones from the Bolshoi simulation, each of which has the geometry of a single 
DEEP2 field, with an additional buffer region around the edge of the field on the sky. One set of lightcones is selected 
to reproduce the primary (high-redshift) DEEP2 fields, and the other is selected to replicate the EGS. The dark-matter 
halo populations of corresponding lightcones in the two sets of mocks are identical. For the L120 and L160 simulations, 
we provide two sets of eight lightcones from each box. The electronic mock catalogs are stored as binary data tables 
conforming to the FITS standard. They consist of a set of rows, one for each galaxy, with information stored in 
columns with the headings below. In all cases, a missing value is indicated by the value —999. 

• 0BJN0: A unique identification number for each object (galaxy or star). 

• RA: Right Ascnesion. 

• DEC: Declination. 

• Z: Redshift (including the peculiar- velocity Doppler shift). 

• MAGR: The apparent R-band magnitude mji. 

• ABSBMAG: The absolute B-band magnitude M B - 

• UB_0: The rest-frame U — B color. 

• ZQUALITY: Analagous to the DEEP2 redshift-quality flag Q, except that we do not use all of the the DEEP2 con- 
fidence codes from Q=l to 4. Galaxies that are assigned successful redshifts in our mock-observation algorithms 
have Q = 4, targeted galaxies that had redshift failures in our algorithm have Q = 1, stars have Q = — 1, and 
galaxies not targeted for spectroscopy have Q = — 2. 

• SELECTED: Galaxies that passed the redshift selection cut in Equation ITUl for the weighted redshift selection we 
apply in the EGS mocks) have this field set to unity. 

• MASKREGI0N Galaxies that fall in the region used for DEEP2 maskmaking, rather than the buffer region around 
the edge, have this field set to unity. Limiting to these galaxies and the ones flagged in the previous item is 
important for estimating the DEEP2 selection function in many cases. 

• PGAL Analagous to the DEEP2 star-galaxy separation probability. Mock galaxies have this field set to unity; 
stars have it set to zero. 

• XP0S, YP0S, and ZP0S The xyz position of this galaxy in the simulation box (in units of the box size). 

• VR The velocity of this galaxy along the line of sight (where positive velocity implies movement away from the 
observer). This can be combined with the Z value to compute the cosmological redshift. 

• GR0UPID A unique identifier for the distinct (parent) halo to which this galaxy belongs. 

• HALOID A unique identifier for the subhalo to which this galaxy belongs. 

• MASE0 The virial mass of this galaxy's dark- matter subhalo, as defined in IBrvan fc Norman (1998). 

15 This column is named M180 in the L120 and L160 mocks, . . . , , , .. . 1on .. , , , 

, . ... . . , ,. ' is that at which the subhalo density is 180 times the background 

where it gives the mass computed withm a sphere whose radius density 
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Figure 13. The selection function of mock DEEP2 galaxies versus subhalo mass, in bins of redshift. Solid lines show the fraction of 
subhalos that conain a galaxy that was included in the sample of observed mock galaxies with successful redshifts in the mocks. Dashed 
lines show the fraction that were scheduled for observation by the DEEP2 spectroscopic targeting algorithm. Dotted lines show the fraction 
that exceed the DEEP2 R = 24.1 apparent magnitude limit. 

• VMAX The maximum circular velocity of the subhalo (at the present time) . 

• GROUPMASS The mass of this galaxy's parent halo 

• CENTRAL Galaxies that are the central galaxies of their parent halos have this field set to unity. 

• SIGMA7_RAW Seventh-nearest neighbor surface density E7 for this galaxy, computed before applying DEEP2 
selection cuts. 

• DELTA7_RAW Seventh-nearest neighbor overdensity 67 = £7 / (Er(z)) 

• SIGMA3_DBS Third-nearest neighbor surface density S3 for this galaxy, computed after applying DEEP2 selection 
cuts. 

• DELTA3JDBS Third-nearest neighbor overdensity 5 3 = T, 3 /{E 3 (z)}. 

• ECOMDIST Comoving distance to the edge of the survey in comoving h" 1 Mpc. If this value is less than lh^ 1 
Mpc, the value of £3 for this galaxy is likely to be inaccurate. 

• EANGDIST Angular distance to the edge of the survey. (This column and the previous one are only defined for 
observed galaxies.) 

• WEIGHT The incompleteness weight w corr assigned to this galaxy. 

The mass dependence of DEEP2 selection 

One of our goals in producing mock catalogs was to understand the selection probability of DEEP2 galaxies as a 
function of their halo mass. As we discussed in Sections l3.2.21 the overall mass-selection function in the mocks is likely 
to have a low-mass cutoff that is sharper than is the case in the real universe, owing to the zero-scatter abundance 
matching model we used to populate the N-body model with galaxies; however, it is possible that other mass-dependent 
selection effects will dominate, making the precise shape of the HOD cutoff less relevant. At any rate, because various 
DEEP2 selection effects, including targeting and redshift-failurc effects, depend on local galaxy density, color, and 
apparent magnitude, one may well be concerned that the DEEP2 selection function might be a strong function of 
halo mass, beyond the cutoff at low mass from the magnitude limit. We can investigate this in detail with the mock 
catalogs produced in this study. 

Figure [131 shows the fraction of mock galaxies that passed various DEEP2 selection cuts, in three different redshift 
ranges, as a function of subhalo mass. Dotted lines show galaxies with R < 24.1, dashed lines show galaxies that were 
assigned to spectroscopic slits by the slitmask-making algorithm, and solid lines show galaxies that were successfully 
assigned redshifts. To produce these curves, we selected galaxies with SELECTED, MASKREGION, and PGAL values equal to 
unity (to exclude galaxies that had no probability of selection); then we computed the fraction that had MAGR < 24.1 
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and ZQUALITY greater than zero and equal to 4, respectively. The figure shows a broad mass cutoff in the mocks, 
spanning nearly an order of magnitude in mass at z ~ 1. The breadth of this transition occurs mostly because the 
DEEP2 flux limit translates into a strongly color-dependent luminosity cut, as shown in Figure [51 This transition is 
probably somewhat broader in reality, since the mock HOD cuts off too sharply at low masses; however, the dominant 
effect is the variable absolute magnitude limit, not the shape of the HOD, since the cutoff in the HOD (cf. Figure 
is sharper than the cutoff in mass shown in Figure 1131 

The mass cutoff also broadens further at higher redshift, such that the magnitude limit alone introduces some level 
of incompleteness all the way up to M ~ 10 1 , the mass scale of massive groups, at the highest redshifts in DEEP2. 
It is important to note, though, that this figure considers only the detection probability of the single galaxy at the 
center of each halo or subhalo. Massive group and cluster halos, containing multiple galaxies, might still have their 
satel lites detected, even if their centrals are not, so the selection function for groups will look somewhat different than 
this (jCerke et al.H2012h . 

The mass dependence of the DEEP2 targeting and redshift failures is much weaker than the magnitude selection. 
There is a slight drop in the spectroscopic targeting rate at high masses, particularly at high redshift. This can probably 
be attributed to the crowding of slits on DEIMOS slitmasks , which will be somewhat more severe for ga laxies in groups 
and clusters. However, as discussed in several other papers (jCooper et alJl2005t iGerke et al.ll2005l 12013 ) . such crowding 
effects on the sky do not necessarily translate into strong selection effects in three-space; hence, the impact on the 
mass-selection function here is quite weak. There is also a slight increase in the redshift-success rate with mass, 
corresponding to the drop in redshift success for the faintest apparent magnitudes. However, the main conclusion from 
this figure is that, for most applications, the targeting and redshift-failure rates in DEEP2 can safely be approximated 
as flat functions of mass. 



